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PREFACE 


This  volume  is  part  of  a  five-volume  set  that  summarizes  the  research  of  participants  in  the  1992 
AFOSR  Summer  Research  Extension  Program  (SREP).  The  current  volume,  Volume  3  of  5, 
presents  the  fmal  reports  of  SREP  participants  at  Rome  Laboratory. 

Reports  presented  in  this  volume  are  arranged  alphabetically  by  author  and  are  numbered 
consecutively  -  e.g.,  1-1,  1-2,  1-3;  2-1,  2-2,  2-3,  with  each  series  of  reports  preceded  by  a 
22-page  management  summary.  Reports  in  the  five-volume  set  are  organized  as  follows: 

VOLUME  TITLE 

lA  Armstrong  Laboratory  (part  one) 

IB  Armstrong  Laboratory  (part  two) 

2  Phillips  Laboratory 

3  Rome  Laboratory 

4 A  Wright  Laboratory  (part  one) 

4B  Wright  Laboratory  (part  two) 

5  Air  Force  Civil  Engineering  Laboratory,  Arnold  Engineering  Development  Center, 
Frank  J.  Seller  Research  Laboratory,  Wilford  Hall  Medical  Center 
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1992  SUMMER  RESEARCH  EXTENSION  PROGRAM  FINAL  REPORTS 


1992  Summer  Research  Extension  Program  Management  Rqxjrt  .  .  . 

Rome  Laboratory 

R^rart 

Number  Retx)rt  Title 

1  Parallelism  in  User  Assisted  Information  Extraction 

2  Noise  Modeling  of  Millimeter- Wave  Field  Effect  Transistors 

3  Novel  Digital  Speech  Compression  Techniques 

4  A  Framework  for  Adaptive  Fault  Management  in  Survivable  C^I  Systems 

5  Waveguides  in  Glass  Substrates,  Waveguide  Lasers,  and  Tunneling  Electro- 
Optic  AND  Gates 

6  Defect  Properties  of  Photorefractive  and  Non-Photorefractive  BSO 

7  Automatic  Construction  of  a  Semantic  Lexicon  for  Use  in  Natural  Language 
Processing  Systems 

8  Optical  Communication  System  Based  on  Tunable  Doped  Fiber  Oscillator 

9  FDTD  Analysis  of  the  Radiation  Properties  of  a  Parabolic  Cylinder  Illuminated 
by  a  Very  Short  Pulse 

10  C‘  Approximation  Techniques  for  Investigating  and  Training  Neural  Networks 

11  A  Fiber  Oscillator  for  Optical  Communications 

12  Simulation  Analysis  of  a  Local  Area  Network  for  XP  at  Rome  Laboratories 


ii 
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1992  SUMMER  RESEARCH  EXTENSION  PROGRAM  (SREP)  MANAGEMENT  REPORT 

1.0  BACKGROUND 

Under  the  provisions  of  Air  Force  Office  of  Scientific  Research  (AFOSR)  contract  F49620-90-C- 
0076,  September  1990,  Research  &  Development  Laboratories  (RDL),  an  8(a)  contractor  in 
Culver  City,  CA,  manages  AFOSR's  Summer  Research  Program.  This  report  is  issued  in  partial 
fulfillment  of  that  contract  (CLIN  0003 AC). 

The  name  of  this  program  was  changed  during  this  year's  period  of  performance.  For  that 
reason,  participants'  cover  sheets  are  captioned  "Research  Initiation  Program"  (RIP),  while  the 
covers  of  the  comprehensive  volumes  are  titled  "Summer  Research  Extension  Program"  (SREP). 
The  program's  sponsor,  the  Air  Force  Office  of  Scientific  Research  (AFOSR),  changed  the  name 
to  differentiate  this  program  from  another  which  also  bore  its  original  name. 

Apart  from  this  name  change,  however,  the  program  remained  as  it  has  been  since  its  initiation 
as  the  Mini-Grant  Program  in  1983.  The  SREP  is  one  of  four  programs  AFOSR  manages  under 
the  Summer  Research  Program.  The  Summer  Faculty  Research  Program  (SFRP)  and  the 
Graduate  Student  Research  Program  (GSRP)  place  college-level  research  associates  in  Air  Force 
research  laboratories  around  the  United  States  for  8  to  12  weeks  of  research  with  Air  Force 
scientists.  The  High  School  Apprenticeship  Program  (HSAP)  is  the  fourth  element  of  the 
Summer  Research  Program,  allowing  promising  mathematics  and  science  students  to  spend  two 
months  of  their  summer  vacations  at  Air  Force  laboratories  within  commuting  distance  from  their 
homes. 

SFRP  associates  and  exceptional  GSRP  associates  are  encouraged,  at  the  end  of  their  summer 
tours,  to  write  proposals  to  extend  their  summer  research  during  the  following  calendar  year  at 
their  home  institutions.  AFOSR  provides  funds  adequate  to  pay  for  75  SREP  subcontracts.  In 
addition,  AFOSR  has  traditionally  provided  further  funding,  when  available,  to  pay  for  additional 
SREP  proposals,  including  those  submitted  by  associates  from  Historically  Black  Colleges  and 
Universities  (HBCUs)  and  Minority  Institutions  (Mis).  Finally,  laboratories  may  transfer  internal 
funds  to  AFOSR  to  fund  additional  SREPs.  Ultimately  the  laboratories  inform  RDL  of  their 
SREP  choices,  RDL  gets  AFOSR  approval,  and  RDL  forwards  a  subcontract  to  the  institution 
where  the  SREP  associate  is  employed.  The  subcontract  (see  Attachment  1  for  a  sample)  cites 
the  SREP  associate  as  the  principal  investigator  and  requires  submission  of  a  report  at  the  end 
of  the  subcontract  period. 

Institutions  are  encouraged  to  share  costs  of  the  SREP  research,  and  many  do  so.  The  most 
common  cost-sharing  arrangement  is  reduction  in  the  overhead,  fringes,  or  administrative 
changes  institutions  would  normally  add  on  to  the  principal  investigator's  or  research  associate's 
labor.  Some  institutions  also  provide  other  support  (e.g.,  computer  run  time,  administrative 
assistance,  facilities  and  equipment  or  research  assistants)  at  reduced  or  no  cost. 

When  RDL  receives  the  signed  subcontract,  we  fund  the  effort  initially  by  providing  90%  of  the 
subcontract  amount  to  the  institution  (normally  $18,000  for  a  $20,000  SREP).  When  we  receive 
the  end-of-research  report,  we  evaluate  it  administratively  and  send  a  copy  to  the  laboratory  for 
a  technical  evaluation.  When  the  laboratory  notifies  us  the  SREP  report  is  acceptable,  we  release 
the  remaining  funds  to  the  institution. 
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2.0  THE  1992  SREP  PROGRAM 


SELECTION  DATA:  In  the  summer  of  1991,  170  faculty  members  (SFRP  associates)  and  142 
graduate  students  (GSRP  associates)  participated  in  the  summer  program.  Of  those,  147  SFRPs 
and  10  GSRPs  submitted  SREP  proposals;  88  SFRP  SREPs  and  7  GSRP  SREPs  were  selected 
for  funding  (total:  95). 


Summer 

1991 

Participants 

Submitted 

SREP 

Proposals 

SREPs 

Funded 

SFRP 

170 

147 

88 

GSRP 

142 

10 

7 

The  funding  was  provided  as  follows: 


Contractual  slots  funded  by  AFOSR  75 

Laboratory- funded  13 

Additional  funding  from  AFOSR  J 

Total  95 


Seven  HBCU/MI  associates  from  the  1991  summer  program  submitted  SREP  proposals;  five 
were  selected  (one  was  lab-funded;  four  were  funded  by  additional  AFOSR  funds). 

By  laboratory,  the  applications  submitted  and  selected  show  in  the  following  table: 


Applied 

Selected 

Air  Force  Civil  Engineering  Laboratory 

6 

4 

Armstrong  Laboratory 

34 

20 

Arnold  Engineering  Development  Center 

12 

2 

Frank  J.  Seiler  Research  Laboratory 

5 

3 

Phillips  Laboratory 

30 

18 

Rome  Laboratory 

16 

11 

Wilford  Hall  Medical  Center 

1 

1 

Wright  Laboratory 

53 

36 

TOTAL 

157 

95 

Note:  Phillips  Laboratory  funded  2  SREPs;  Wright  Laboratory  funded  11;  and  AFOSR 
funded  7  beyond  its  contractual  75. 
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ADMINISTRATIVE  EVALUATION:  The  administrative  quality  of  the  SREP  associates'  final 
reports  was  satisfactory.  Most  complied  with  the  formatting  and  other  instructions  RDL  provided 
to  them.  In  the  final  days  of  December  1992  and  in  the  first  two  months  of  1993,  several 
associates  called  and  requested  no-cost  extensions  of  up  to  six  months.  After  consultation  with 
our  AFOSR  Contracting  Officer's  Representative,  RDL  approved  the  requests  but  asked  that  all 
such  associates  provide  an  interim  report  to  be  included  in  this  volume.  That  caused  an  AFOSR- 
approved  delay  beyond  the  1  April  1993  submission  of  this  report.  The  subcontracts  were 
funded  by  $1,893,616  of  Air  Force  money.  Institutions'  cost  sharing  amounted  to  $948,686. 

TECHNICAL  EVALUATION:  The  form  we  used  to  gather  data  for  technical  evaluation  and 
the  technical  evaluations  of  the  SREP  reports  are  provided  as  Attachment  2.  This  summary 
evaluation  is  shown  by  SREP  number.  The  average  rating  range  was  from  3.1  to  5.0.  The 
overall  average  for  those  evaluated  was  4.6  out  of  5.00.  The  three  rating  factors  with  the  highest 
average  scores  were: 

o  The  USAF  should  continue  to  pursue  the  research  in  this  RIP  report, 
o  The  money  spent  on  this  RIP  report  was  well  worth  it. 
o  I'll  be  eager  to  be  a  focal  point  for  summer  and  RIP  associates  in  the  future. 

Thus  it  is  clear  that  the  laboratories  place  a  high  value  on  AFOSR' s  Summer  Research  Program: 
SFRP,  GSRP,  and  SREP. 


3.0  SUBCONTRACTS  SUMMARY 

Table  1  lists  contractually  required  information  on  each  SREP  subcontract.  The  individual 
reports  are  published  in  volumes  as  follows: 


Laboratory  Volume 


Air  Force  Civil  Engineering  Laboratory  5 

Armstrong  Laboratory  1 

Arnold  Engineering  Development  Center  5 

Frank  J.  Seiler  Research  Laboratory  5 

Phillips  Laboratory  2 

Rome  Laboratory  3 

Wilford  Hall  M^ical  Center  5 

Wright  Laboratory  4 


INTRODUCTION  -  3 


TABLE  1:  SUBCONTRACTS  SUMMARY 


Researcher's  name 

Institution 

Location 


Abbott,  Ben  A 
Vanderbilt  University 
Nashville,  TN  37235 

Acharya ,  Raj 

State  University  of  New  York,  Buffalo 
Buffalo,  NY  14260 

Adams,  Christopher  M 
Oklahoma  State  University 
Stillwater,  OK  74078 

Anderson,  Richard  A 
University  of  Missouri,  Rolla 
Rolla,  MO  65401 

Arora,  Vi jay  K 
Wilkes  University 
Wilkes-Barre,  PA  18766 

Ball,  William  P 
Duke  University 
Durham,  NC  27706 

Battles,  Frank  P 
Massachusetts  Maritime  Academy 
Buzzard's  Bay,  MA  025321803 

Bieniek,  Ronald  J 
University  of  Missouri,  Rolla 
Rolla,  MO  65401 

Blystone,  Robert  V 
Trinity  University 
San  Antonio,  TX  78212 

Cha,  Soyoung  S 

University  of  Illinois,  Chicago 
Chicago,  IL  60680 

Chandra,  D.  V.  Satish 
Kansas  State  University 
Manhattan,  KS  66506 

Chenette,  Eugene  R 
University  of  Florida 
Gainesville,  FL  32611 

Christensen,  Douglas  A 

University  of  Utah 

Salt  Lake  City,  UT  84112 


Highest  Subcontract 
Degree  Number  Duration 

Department 

Amount  Sharing 


MS  135  01/01/92-12/31/92 

Dept  of  Electrical  Engineering 
19966.00  0.00 

PhD  151  01/01/92-12/31/92 

Dept  of  Electrical  &  Comp  Engrg 
20000.00  0.00 

PhD  68  01/01/92-12/31/92 

Dept  of  Chemistry 
20000.00  0.00 

PhD  50  01/01/92-12/31/92 

Dept  of  Physics 

20000.00  5000.00 

PhD  3  10/01/91-09/30/92 

Dept  of  Electrical  &  Comp  Engrg 
19996.00  36208.00 

PhD  71  01/01/92-12/31/92 

Dept  of  Civil  &  Environmental  Eng 
20000.00  26747.00 

PhD  152  01/01/92-12/31/92 

Dept  of  Basic  Sciences 
20000.00  22000.00 

PhD  147  01/01/92-12/31/92 

Dept  of  Physics 

19945.00  4000.00 

PhD  127  01/01/92-12/31/92 

Dept  of  Biology 

20000.00  14783.00 

PhD  011  01/01/92-12/31/92 

Dept  of  Mechanical  Engineering 
20000.00  3842.00 

PhD  89  01/18/92-10/17/92 

Dept  of  Electrical  Engineering 
20000.00  11170.00 

PhD  106  01/01/92-12/31/92 

Dept  of  Electrical  Engineering 
20000.00  0.00 

PhD  83  01/01/92-12/31/92 

Dept  of  Electrical  Engineering 
19999.00  5000.00 
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Chubb,  Gerald  P 
Ohio  State  University 
Columbus,  OH  43235 

Courter,  Robert  W 
Louisiana  State  University 
Baton  Rouge,  LA  70803 

Dey,  Pradip  P 
Hampton  University 
Hampton,  VA  23668 

Draut ,  Arthur  W 

Embry  Riddle  Aeronautical  University 
Prescott,  AZ  86301 

Dreisbach,  Joseph 
University  of  Scranton 
Scranton,  PA  185104626 

Dror,  Itiel 
Harvard  University 
Cambridge,  MA  02138 

Drost -Hansen,  W. 

University  of  Miami 
Coral  Gables,  FL  33124 

Dunleavy,  Lawrence  P 
University  of  South  Florida 
Tampa,  FL  33620 

Evans ,  Joseph  B 
University  of  Keinsas 
Lawrence,  KS  66045 

Flowers ,  George  T 
Auburn  University 
Auburn,  AL  368495341 

Gantenbein,  Rex  E 
University  of  Wyoming 
Laramie,  WY  82071 

Garcia,  Ephrarim 
Vanderbilt  University 
Nashville,  TN  37235 

German,  Fred  J 

Auburn  University 

Auburn  University,  AL  36830 

Gould,  Richard  D 

North  Carolina  State  University 

Raleigh,  NC  276957910 

Gove ,  Randy  L 

University  of  Alabama,  Huntsville 
Huntsville,  AL  35899 

Grabowski ,  Marek 

University  of  Colorado,  Colorado  Springs 
Colorado  Springs,  CO  809337150 


PhD  26  01/01/92-12/31/92 

Dept  of  Aviation 

20000.00  .  7600.00 

PhD  8  10/01/91-09/30/92 

Dept  of  Mechanical  Engineering 
20000.00  445.00 

PhD  120  01/01/92-12/31/92 

Computer  Science  Department 
19921.00  0.00 

PhD  133  01/06/92-05/08/92 

Computer  Science  Dept 
19431.00  0.00 

PhD  108  12/01/91-12/01/92 

Dept  of  Chemistry 

20000.00  4000.00 

BS  76  01/01/92-12/31/92 

Dept  of  Psychology 

20000.00  0.00 

PhD  124  12/01/91-12/01/92 

Dept  of  Chemistry 

20000.00  12000.00 

PhD  41  01/01/92-12/31/92 

Dept  of  Electrical  Engineering 
20000.00  6463.00 

PhD  96  01/01/92-12/31/92 

Dept  of  Electrical  &  Comp  Engrg 

20000.00  0.00 

PhD  73  01/01/92-12/30/92 

Dept  of  Mechanical  Engineering 
19986.00  12121.00 

PhD  22  01/01/91-12/31/92 

Dept  of  Computer  Science 

20000.00  26643.00 

PhD  32  12/01/91-11/30/92 

Dept  of  Mechanical  Engineering 
20000.00  9659.00 

PhD  49  01/01/92-12/31/92 

Dept  of  Electrical  Engineering 
20000.00  0.00 

PhD  87  01/01/92-12/31/92 

Dept  of  Mech  and  Aerospace  Engrg 
20000.00  14424.00 

MS  122  01/01/92-12/31/92 

Dept  of  Physics 

20000.00  3469.00 

PhD  92  01/01/92-12/31/92 

Dept  of  Physics 

197.00.00  0.00 
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Gunaratne ,  Man j  riker 
University  of  South  Florida 
Tampa,  FL  33620 

Hall,  Ernest  L 
University  of  Cincinnati 
Cincinnati,  OH  452210072 

Hamilton,  William  L 
Salem  State  College 
Salem,  MA  01970 

Hamilton,  Kirk  L 

Xavier  University  of  Louisiana 

New  Orleans,  LA  70125 

Harris,  Harold  H 

University  of  Missouri,  St. Louis 

St.  Louis,  MO  63121 

Hartung,  George  H 
University  of  Hawaii 
Honolulu,  HI  96822 

Hatfield,  Steven  L 
University  of  Kentucky 
Lexington,  KY  40506 

Hedman,  Paul  O'Dell 
Brigham  Yovmg  University 
Provo,  UT  84602 

Heister,  Stephen  D 

Purdue  University 

West  Lafayette,  IN  47907 

Hess,  David  J 

University  of  Texas,  Austin 
Austin,  TX  78713 

Hoffman,  R.  W 

Case  Western  Reserve  University 
Cleveland,  OH  44106 

Huerta,  Manuel  A 
University  of  Miami 
Coral  Gables,  FL  33124 

Hui ,  David 

University  of  New  Orleans 
New  Orleans,  LA  70148 

Iyer,  Ashok 

University  of  Nevada,  Las  Vegas 
Las  Vegas,  NV  89154 

Khonsari,  Michael  M 
University  of  Pittsburgh 
Pittsburgh,  PA  15260 

Kibert,  Charles  J 
University  of  Florida 
Gainesville,  FL  32611 


PhD  90  01/01/92-12/31/92 

Dept  of  Civil  Engrg  &  Mechanics 
19994.00  10062.00 

PhD  134  01/01/92-12/31/92 

Dept  of  Robotics  Research 
19975.00  0.00 

PhD  47  01/01/92-12/31/92 

Dept  of  Geography 

20000.00  32000.00 

PhD  57  01/01/92-12/31/92 

Dept  of  Biology 
20000.00  16100.00 

PhD  94  01/01/92-12/31/92 

Dept  of  Chemistry 

19300.00  8600.00 

PhD  46  01/01/92-12/31/92 

Dept  of  Physiology 

20000.00  7530.00 

BS  23  01/01/92-12/31/92 

Dept  of  Materials  Science  &  Engrg 
20000.00  28625.00 

PhD  17  01/01/92-12/31/92 

Dept  of  Chemical  Engineering 
19999.00  6928.00 

PhD  5  01/01/92-12/31/92 

School  of  Aero  &  Astronautics 
20000.00  4419.00 

BA  149  01/01/92-12/31/92 

Dept  of  Psychology 

19914.00  8784.00 

PhD  99  01/01/92-12/31/92 

Dept  of  Physics 

19770.00  0.00 

PhD  62  01/01/92-12/31/92 

Dept  of  Physics 

20000.00  1207.00 

PhD  116  01/01/92-12/31/92 

Dept  of  Mechanical  Engineering 
20000.00  0.00 

PhD  74  01/01/92-12/31/92 

Dept  of  Electrical  &  Comp  Engrg 
20000.00  18549.00 

PhD  53  01/01/92-12/31/92 

Dept  of  Mechanical  Engineering 
20000.00  32958.00 

PhD  2  01/01/92-12/31/92 

Dept  of  Fire  Testing  &  Research 
20000.00  6928.00 
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Klarup,  Douglas  G 
University  of  Montana 
Missoula,  MT  59812 

Koblasz,  Arthur  J 

Georgia  Institute  of  Technology 

Atlanta,  GA  30332 

Kornreich,  Philipp 
Syracuse  University 
Syracuse,  NY  13244 

Kuo,  Spencer  P 
Polytechnic  University 
Farmingdale,  NY  11735 

Langhoff,  Peter  W 
Indiana  University 
Bloomington,  IN  47402 

Lee,  Byung-Lip 

Pennsylvania  State  University 
University  Park,  PA  16802 

Leigh,  Wallace  B 
Alfred  University 
Alfred,  NY  14802 

Liddy,  Elizcibeth 
Syracuse  University 
Syracuse,  NY  132444100 

Liu,  Cheng 

University  of  North  Carolina,  Charlotte 
Charlotte,  NC  28270 

Main,  Robert  G 

California  State  University,  Chico 
Chico,  CA  959290504 

Mains,  Gilbert  J 
Oklahoma  State  University 
Stillwater,  OK  74078 

Marathay,  Arvind  S 
University  of  Arizona 
Tucson,  AZ  85721 

Martin,  Charlesworth  R 
Norfolk  State  University 
Norfolk,  VA  23504 

Mayes,  Jessica  L 
University  of  Kentucky 
Lexington,  KY  405034203 

Mulligan,  Benjamin  E 
University  of  Georgia 
Athens,  GA  30602 

Munday,  Edgar  G 

University  of  North  Carolina,  Charlotte 
Charlotte,  NC  28223 


PhD  84  01/01/92-12/31/92 

Dept  of  Chemistry 
20000.00  0.00 

PhD  145  01/01/92-09/30/92 

Dept  of  Civil  Engineering 
19956.00  0.00 

PhD  35  10/01/91-09/30/92 

Dept  of  Electrical  &  Comp  Engrg 
20000.00  0.00 

PhD  59  01/01/92-12/31/92 

Dept  of  Electrical  Engineering 
20000.00  9916.00 

PhD  115  01/01/92-12/31/92 

Dept  of  Chemistry 

20000.00  35407.00 

PhD  93  01/01/92-12/31/92 

Dept  of  Engrg  Science  &  Mechanics 
20000.00  8173.00 

PhD  118  01/01/92-12/31/92 

Dept  of  Electrical  Engineering 
19767.00  18770.00 

PhD  104  01/01/92-12/31/92 

Dept  of  Information  Studies 
20000.00  0.00 

PhD  6  11/01/99-12/31/92 

Dept  of  Engineering  Technology 
20000.00  0.00 

PhD  28  01/01/92-06/30/92 

Dept  of  Communication  Design 
20000.00  7672.00 

PhD  52  01/01/92-12/31/92 

Dept  of  Chemistry 

19071.00  8746.00 

PhD  51  01/01/92-12/31/92 

Dept  of  Optical  Sciences 
20000.00  0.00 

PhD  125  01/01/92-12/31/92 

Dept  of  Physics  &  Engineering 
20000.00  0.00 

BS  16  01/01/92-12/31/92 

Dept  of  Material  Science  &  Engrng 
20000.00  28625.00 

PhD  54  01/01/92-12/31/92 

Dept  of  Psychology 

19895.00  13677.00 

PhD  38  10/01/91-10/30/92 

Dept  of  Mechanical  Engineering 
20000.00  11638.00 
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Nurre,  Joseph  H 
Ohio  University 
Athens,  OH  45701 

Orkwis,  Paul  D 
University  of  Cincinnati 
Cincinnati,  OH  452210070 

Patra,  Amit  L 
University  of  Puerto  Rico 
Mayaquez,  PR  00681 

Peters  II,  Richard  A 
Vanderbilt  University 
Nashville,  TN  37235 

Pollack,  Steven  K 
University  of  Cincinnati 
Cincinnati,  OH  452200012 

Prescott,  Glenn  E 
University  of  Kansas 
Lawrence,  KS  66045 

Price,  James  L 
University  of  Iowa 
Iowa  City,  lA  52242 

Qazi,  Salahuddin 
SUNY,  Utica 
Utica,  NY  13504 

Rappaport ,  Carey  M 
Northeastern  University 
Boston,  MA  02115 

Raws on,  Jenny  L 

North  Dakota  State  University 

Fargo,  ND  58105 

Riccio,  Gary  E 

University  of  Illinois,  Urbana 
Urbana,  IL  61821 

Rotz,  Christopher  A 
Brigham  Young  University 
Provo,  UT  84602 

S  chwar t  z ,  Mart in 
University  of  North  Texas 
Denton,  TX  762035068 

Senseman,  David  M 

University  of  Texas,  San  Antonio 

San  Antonio,  TX  78285 

Sensiper,  Martin 
University  of  Central  Florida 
Orlando,  FL  32816 

Shamma,  Jeff  S 
University  of  Texas,  Austin 
Austin,  TX  78713 


PhD  56  01/01/92-12/31/92 

Dept  of  Electrical  &  Comp  Engrg 
19842.00  15135.00 

PhD  14  10/01/91-10/30/92 

Dept  of  Engineering  Mechanics 

19966.00  23017.00 

PhD  69  01/01/92-12/31/92 

Dept  of  General  Engineering 
20000.00  2750.00 

PhD  160  01/01/92-12/31/92 

Dept  of  Electrical  Engineering 
20000.00  0.00 

PhD  31  01/01/92-12/31/92 

Dept  of  Materials  Sci  &  Engrg 
20000.00  14877.00 

PhD  72  01/01/92-12/31/92 

Dept  of  Electrical  Engineering 
20000.00  8000.00 

PhD  48  01/01/92-12/30/92 

Dept  of  Sociology 

20000.00  8600.00 

PhD  129  01/01/92-12/31/92 

Dept  of  Electrical  Engineering 
20000.00  25000.00 

PhD  58  01/01/92-06/30/92 

Dept  of  Electrical  &  Comp  Engmg 
19999.00  0.00 

PhD  144  01/01/92-12/31/92 

Dept  of  Electrical  Engineering 
19997.00  19826.00 

PhD  80  01/01/92-12/31/92 

Dept  of  Human  Perception 
20000.00  0.00 

PhD  136  12/01/91-12/31/92 

Dept  of  Manufacturing  Engineering 
20000.00  11814.00 

PhD  55  01/01/92-12/31/92 

Dept  of  Chemistry 

20000.00  18918.00 

PhD  77  12/01/91-11/30/92 

Dept  of  Information 

20000.00  19935.00 

BS  15  11/01/91-05/31/92 

Dept  of  Electrical  Engineering 
20000.00  0.00 

PhD  70  01/01/92-12/31/92 

Dept  of  Electrical  Engineering 
20000.00  0.00 
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Shively,  Jon  H 

California  State  University,  Northridge 
Northridge,  CA  91330 

Singh,  Sahjendra  N 
University  of  Nevada,  Las  Vegas 
Las  Vegas,  NV  89014 

Smith,  Gerald  A 
Pennsylvania  State  University 
University  Park,  PA  16802 

Stephens,  Benjamin  R 
Clemson  University 
Clemson,  SC  29634 

Sudkamp ,  Thomas 
Wright  State  University 
Dayton,  OH  45435 

Sydor,  Michael 

University  of  Minnesota,  Duluth 
Duluth,  MN  55804 

Tankin,  Richard  S 
Northwestern  University 
Evanston,  IL  60208 

Taylor,  Michael  D 
University  of  Central  Florida 
Orlando,  FL  32816 

Teegarden,  Kenneth  J 
University  of  Rochester 
Rochester,  NY  14627 

Tew,  Jeffrey  D 

Virginia  Polytech  Instit  cind  State  Univ 
Blacksburg,  VA  24061 

Tipping,  Richard  H 
University  of  Alabama 
Tuscaloosa,  AL  35487 

Tripathi ,  Ram  C 

University  of  Texas,  Scin  Antonio 
San  Antonio,  TX  78249 

Wells,  Fred  V 

Idciho  State  University 

Pocatello,  ID  83209 

Whitefield,  Phillip  D 
University  of  Missouri,  Rolla 
Rolla,  MO  65401 

Wolfenstine,  Jeffrey  B 
University  California,  Irvine 
Irvine,  CA  92717 

Wolper,  James  S 
Idaho  State  University 
Pocatello,  ID  83209 


PhD  140  01/01/92-12/31/92 

Dept  of  CIAM 

20000.00  14553.00 

PhD  79  01/01/92-12/31/92 

Dept  of  Electrical  Engineering 
20000.00  20595.00 

PhD  63  07/01/92-07/01/93 

Dept  of  Physics 

20000.00  0.00 

PhD  114  01/01/92-12/31/92 

Dept  of  Psycology 

19988.00  4250.00 

PhD  97  01/01/92-08/31/92 

Dept  of  Computer  Science 
20000.00  18739.00 

PhD  11  01/01/92-12/31/92 

Dept  of  Physics 

20000.00  0.00 

PhD  44  01/01/92-12/31/92 

Dept  of  Mechanical  Engineering 
20000.00  29103.00 

PhD  141  05/01/92-07/31/92 

Dept  of  Mathematics 

20000.00  1587.00 

PhD  98  01/01/92-12/31/92 

Dept  of  Optics 

20250.00  60600.00 

PhD  137  03/01/92-09/30/92 

Dept  of  Industrial  Engineering 
17008.00  4564.00 

PhD  81  01/01/92-05/31/92 

Dept  of  Physics  &  Astronomy 
20000.00  15000.00 

PhD  105  01/01/92-12/31/92 

Dept  of  Mathematics 

20000.00  2274.00 

PhD  155  01/01/92-12/31/92 

Dept  of  Chemistry 

20000.00  8000.00 

PhD  25  01/01/92-12/31/92 

Dept  of  Chemistry 

19991.00  25448.00 

PhD  18  01/01/92-12/31/92 

Dept  of  Mechanical  Engineering 
20000.00  11485.00 

PhD  138  01/15/92-09/30/92 

Dept  of  Mathematics 

20000.00  4828.00 
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Zavodney,  Lawrence  D 
Ohio  State  University 
Columbus,  OH  43210 

Zimmerman,  Wayne  J 
Texas  Women  University 
Denton,  TX  76204 


PhD  148 

Dept  of  Engineering 
20000.00 

PhD  111 

Dept  of  Mathematics 

19990.00 


01/01/92-12/31/92 

Mechanics 

0.00 

01/01/92-12/31/92 

8900.00 
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ATTACHMENT  1: 
SAMPLE  SREP  SUBCONTRACT 


AIR  FORCE  OFFICE  OF  SCIENTIFIC  RESEARCH 
1993  SUMMER  RESEARCH  EXTENSION  PROGRAM  SUBCONTRACT  93-36 


BETWEEN 


Research  &  Development  Laboratories 
5800  Uplander  Way 
Culver  City,  CA  90230-6608 


AMD 


University  of  Delaware 
Sponsored  Programs  Admin. 
Newark,  DE  19716 


REFERENCE:  Summer  Research  Extension  Program  Proposal  93-36 

Start  Date:  01/01/93  End  Date:  12/31/93 
Proposal  amount:  $20000.00 


(1)  PRINCIPAL  INVESTIGATOR:  Dr.  Ian  W.  Hall 

Materials  Science 
University  of  Delaware 
Newark/  DE  19716 


(2)  UNITED  STATES  AF08R  CONTRACT  NUMBER:  P49620-90-C-09076 


(3)  CATALOG  OF  FEDERAL  DOMESTIC  ASSISTANCE  NUMBER  (CFDA) :  12.800 

PROJECT  TITLE:  AIR  FORCE  DEFENSE  RESEARCH  SOURCES  PROGRAM 


(4)  ATTACHMENTS  1  AND  2:  SREP  REPORT  INSTRUCTIONS 


***  SIGN  SREP  SUBCONTRACT  AND  RETURN  TO  RDL  *** 
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1 .  BACKGROUND :  Research  &  Development  Laboratories  (RDL)  is  under  contract  (F49620-90-C- 
0076)  to  the  United  States  Air  Force  to  administer  the  Summer  Research  Programs  (SRP), 
sponsored  by  the  Air  Force  Office  of  Scientific  Research  (AFOSR),  Bolling  Air  Force  Base,  D.C. 
Under  the  SRP,  a  selected  number  of  college  faculty  members  and  graduate  students  spend  part 
of  the  summer  conducting  research  in  Air  Force  laboratories.  After  completion  of  the  summer  tour 
participants  may  submit,  through  their  home  institutions,  proposals  for  follow-on  research.  The 
follow-on  research  is  known  as  the  Research  Initiation  Program  (RIP).,  Approximately  75  RIP 
proposals  annually  will  be  selected  by  the  Air  Force  for  funding  of  up  to  $20,000;  shared  funding 
by  the  academic  institution  is  encouraged.  RIP  efforts  selected  for  funding  are  administered  by 
RDL  through  subcontracts  with  the  institutions.  This  subcontract  represents  such  an  agreement 
between  RDL  and  the  institution  designated  in  Section  5  below. 

2.  RDL  PAYMENTS:  RDL  will  provide  the  following  payments  to  RIP  institutions: 

•  90  percent  of  the  negotiated  RIP  dollar  amount  at  the  start  of  the  RIP  Research  period. 

•  the  remainder  of  the  funds  within  30  days  after  receipt  at  RDL  of  the  acceptable  written  final 
report  for  the  RIP  research. 

3 .  INSTITUTION’S  RESPONSIBILITIES:  As  a  subcontractor  to  RDL,  the  institution  designated  on 
the  title  page  will: 

a.  Assure  that  the  research  performed  and  the  resources  utilized  adhere  to  those  defined  in  the 
RIP  proposal. 

b.  Provide  the  level  and  amounts  of  institutional  support  specified  in  the  RIP  proposal. 

c.  Notify  RDL  as  soon  as  possible,  but  not  later  than  30  days,  of  any  changes  in  3a  or  3b 
above,  or  any  change  to  the  assignment  or  amount  of  participation  of  the  Principal 
Investigator  designated  on  the  title  page. 

d.  Assure  that  the  research  is  completed  and  the  final  report  is  delivered  to  RDL  not  later  than 
twelve  months  from  the  effective  date  of  this  subcontract.  The  effective  date  of  the 
subcontract  is  one  week  after  the  date  that  the  institution’s  contracting  representative  signs 
this  subcontract,  but  no  later  than  January  15,  1992. 

e.  Assure  that  the  final  report  is  submitted  in  the  format  shown  in  Attachment  1. 
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f.  Agree  that  any  release  of  information  relating  to  this  subcontract  (news  releases,  articles, 
manuscripts,  brochures,  advertisements,  still  and  motion  pictures,  speeches,  trade  association 
meetings,  symposia,  etc.)  will  include  a  statement  that  the  project  or  effort  depicted  was  or 
is  sponsored  by:  Air  Force  Office  of  Scientific  Research,  Bolling  AFB,  D.C. 

g.  Notify  RDL  of  inventions  or  patents  claimed  as  the  result  of  this  research  in  a  format 
specified  in  Attachment  1 . 

h.  RDL  is  required  by  the  prime  contract  to  flow  down  patent  rights  and  technical  data 
requirements  in  this  subcontract.  Attachment  2  to  this  subcontract  contains  a  list  of  contract 
clauses  incorporated  by  reference  in  the  prime  contract. 

4.  All  notices  to  RDL  shall  be  addressed  to: 

RDL  Summer  Research  Program  Office 
5800  Uplander  Way 
Culver  City,  CA  90230-6608 

5.  By  their  signatures  below,  the  parties  agree  to  the  provisions  of  this  subcontract. 


Abe  S.  Sopher  Signature  of  Institution  Contracting  Official 

RDL  Contracts  Manager 

Typed/Printed  Name 


Date  Title 


Institution 


Date/Phone 
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Attachment  1 


Final  Report  Format 

1 .  All  RIP  Principal  Investigators  will  submit  a  final  report  of  the  research  conducted. 

2.  One  copy  of  the  report  is  due  to  RDL  no  later  than  twelve  months  after  the  effective  date  of  the 
RIP  subcontract.  At  the  same  time,  submit  one  copy  to  the  Air  Force  laboratory  focal  point. 

3.  The  title  page  should  contain  the  title  of  the  research,  the  Principal  Investigator  and  or  other  co¬ 
investigators,  the  month  and  year  of  issue,  the  university  with  department  and  address,  and 
acknowledgement  of  sponsorship  by  AFOSR  (see  clause  3f  of  this  subcontract). 

4.  For  text,  use  a  font  that  is  12  characters  per  inch  (elite)  and  as  close  to  letter  quality  as  possible. 
Start  with  the  title  in  all  caps  one  and  one-half  inches  from  the  top  of  the  first  page;  if  the  title 
requires  two  or  more  lines,  single  space  it.  Double  space  below  the  title,  and  then  center  and  t5q)e 
the  researcher’s  title  and  name.  Then  space  twice  and  begin  the  double-spaced  text. 

Use  a  one-and-one-half-inch  left  margin  and  a  one-inch  right  margin  for  the  body  of  the  text. 
Center  page  numbers  at  the  foot  of  each  page,  one  inch  from  the  bottom.  Each  page  should  have 
a  one-inch  margin  at  the  top.  The  format  should  be  that  of  a  standard  research  paper:  it  should 
begin  with  a  one-paragraph  abstract  (on  its  own  page)  summarizing  your  work  and  should  be 
followed  by  an  introduction,  a  discussion  of  the  problem,  a  results  section,  and  a  conclusion.  Since 
multiple  copies  of  your  report  may  be  required,  assure  that  all  pages  can  be  readily  copied  to  a 
black-and-white  8  1/2"  by  11 "  page.  (No  colors,  such  as  blue  or  green,  that  don’t  photocopy  well, 
and  no  foldouts,  please.) 

5.  The  report  must  be  accompanied  by  a  separate  statement  on  whether  or  not  any  inventions  or 
patents  have  resulted  from  this  research.  If  yes,  use  a  DD  Form  882  (supplied  by  RDL  on  request) 
to  indicate  the  patent  filing  date,  serial  number,  title,  and  a  copy  of  the  patent  application,  and 
patent  number  and  issue  date  for  any  subject  invention  in  any  country  in  which  the  subcontractor 
has  applied  for  patents. 
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Attachment  2 


Contract  Pauses 


This  contract  incorporates  by  reference  the  following  clauses  of  the  Federal  Acquisition  Regulations 
(FAR),  with  the  same  force  and  effect  as  if  they  were  given  in  full  text.  Upon  request,  the  Contracting 
Officer  or  RDL  will  make  their  full  text  available  (FAR  52.252-2). 


FAR  CLAUSES  TITLE  AND  DATE 

52.202- 1  DEFINITIONS  (APR  1984) 

52.203- 1  OFFICIALS  NOT  TO  BENEFIT  (APR  1984) 


52.203- 3  GRATUITIES  (APR  1984) 

52.203- 5  COVENANT  AGAINST  CONTINGENT  FEES  (APR  1984) 

52.304-6  RESTRICTIONS  ON  SUBCONTRACTOR  SALES  TO  THE  GOVERNMENT 

(JUL  1985) 

52.203- 7  ANTI-KICKBACK  PROCEDURES  (OCT  1988) 

52.203- 12  LIMITATION  ON  PAYMENTS  TO  INFLUENCE  CERTAIN  FEDERAL 

TRANSACTIONS  (JAN  1990) 

52.204- 2  SECURITY  REQUIREMENTS  (APR  1984) 

52.209-6  PROTECTING  THE  GOVERNMENT’S  INTEREST  WHEN 

SUBCONTRACTING  WITH  CONTRACTORS  DEBARRED,  SUSPENDED, 
OR  PROPOSED  FOR  DEBARMENT  (MAY  1989) 

52.212-8  DEFENSE  PRIORITY  AND  ALLOCATION  REQUIREMENTS  (MAY  1986) 

52.215- 1  EXAMINATION  OF  RECORDS  BY  COMPTROLLER  GENERAL  (APR  1984) 

52.215- 2  AUDIT  -  NEGOTIATION  (DEC  1989) 

52.222- 26  EQUAL  OPPORTUNITY  (APR  1984) 

52.222- 28  EQUAL  OPPORTUNITY  PREAWARD  CLEARANCE  OF  SUBCONTRACTS 

(APR  1984) 

52.222- 35  AFFIRMATIVE  ACTION  FOR  SPECIAL  DISABLED  AND  VIETNAM  ERA 

VETERANS  (APR  1984) 

52.222- 36  AFFIRMATIVE  ACTION  FOR  HANDICAPPED  WORKERS  (APR  1984) 
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52.222- 37 

52.223- 2 

52.232- 6 

52.224- 1 

52.224- 2 

52.225- 13 

52.227- 1 

52.227- 2 

52.227- 10 

52.227- 11 

52.228- 6 

52.228- 7 
52.230-5 

52.232- 23 
52.237-3 
52.246-25 

52.249- 6 

52.249- 14 
52.251-1 


EMPLOYMENT  REPORTS  ON  SPECIAL  DISABLED  VETERANS  AND 
VETERANS  OF  THE  VIETNAM  ERA  (JAN  1988) 

CLEAN  AIR  AND  WATER  (APR  1984) 

DRUG-FREE  WORKPLACE  (MAR  1989) 

PRIVACY  ACT  NOTIFICATION  (APR  1984) 

PRIVACY  ACT  (APR  1984) 

RESTRICTIONS  ON  CONTRACTING  WITH  SANCTIONED  PERSONS 
(MAY  1989) 

AUTHORIZATION  AND  CONSENT  (APR  1984) 

NOTICE  AND  ASSISTANCE  REGARDING  PATENT  AND  COPYRIGHT 
INFRINGEMENT  (APR  1984) 

FILING  OF  PATENT  APPLICATIONS  -  CLASSIFIED  SUBJECT  MATTER 
(APR  1984) 

PATENT  RIGHTS  -  RETENTION  BY  THE  CONTRACTOR  (SHORT  FORM) 
(JUN  1989) 

INSURANCE  -  IMMUNITY  FROM  TORT  LIABILITY  (APR  1984) 

INSURANCE  -  LIABILITY  TO  THIRD  PERSONS  (APR  1984) 

DISCLOSURE  AND  CONSISTENCY  OF  COST  ACCOUNTING  PRACTICES 
(SEP  1987) 

ASSIGNMENT  OF  CLAIMS  (JAN  1986) 

CONTINUITY  OF  SERVICES  (APR  1984) 

LIMITATION  OF  LIABILITY  -  SERVICES  (APR  1984) 

TERMINATION  (COST-REIMBURSEMENT)  (MAY  1986) 

EXCUSABLE  DELAYS  (APR  1984) 

GOVERNMENT  SUPPLY  SOURCES  (APR  1984) 
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DoD  FAR  CLAUSES  TITLE  AND  DATE 


252.203- 7001 

252.203- 7002 

252.223-7500 

252.225-7001 

252-225-7023 

252.227- 7013 

252.227- 7018 

252.227- 7029 

252.227- 7034 

252.227- 7037 

252.231- 7000 

252.231- 7001 

252.231- 7003 
252.251-7000 
252.271-7001 


SPECIAL  PROHIBITION  ON  EMPLOYMENT  (MAR  1989) 

STATUTORY  COMPENSATION  PROHIBITIONS  AND  REPORTING 
REQUIREMENTS  RELATING  TO  CERTAIN  FORMER  DEPARTMENT  OF 
DEFENSE  (DoD)  EMPLOYEES  (APR  1988) 

DRUG-FREE  WORK  FORCE  (SEP  1988) 

BUY  AMERICAN  ACT  AND  BALANCE  OF  PAYMENTS  PROGRAM  (APR 
1985) 

RESTRICTION  ON  ACQUISITION  OF  FOREIGN  MACHINE  TOOLS  (JAN 
1989) 

RIGHTS  IN  TECHNICAL  DATA  AND  COMPUTER  SOFTWARE  (OCT 
1988) 

RESTRICTIVE  MARKINGS  ON  TECHNICAL  DATA  (OCT  1988) 
IDENTIFICATION  OF  TECHNICAL  DATA  (APR  1988) 

PATENTS  -  SUBCONTRACTS  (APR  1984) 

VALIDATION  OF  RESTRICTIVE  MARKINGS  ON  TECHNICAL  DATA 
(APR  1988) 

SUPPLEMENTAL  COST  PRINCIPLES  (APR  1984) 

PENALTIES  FOR  UNALLOWABLE  COSTS  (APR  1988) 

CERTIFICATION  OF  INDIRECT  COSTS  (APR  1986) 

ORDERING  FROM  GOVERNMENT  SUPPLY  SOURCES  (APR  1984) 

RECOVERY  OF  NONRECURRING  COSTS  ON  COMMERCIAL  SALES  OF 
DEFENSE  PRODUCTS  AND  TECHNOLOGY  AND  OF  ROYALTY  FEES 
FOR  USE  OF  DoD  TECHNICAL  DATA  (FEB  1989) 
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7  November  1991 


AFOSR/PKO 

Bldg.  410,  Room  C-124 

Bolling  AFB,  DC  20332-6448 

Attn:  Ms.  Kathleen  Wetherell 

Dear  Ms.  Wetherell: 

Enclosed  for  your  approval  is  the  model  subcontract  for  the  Research  Initiation  Program  under  the 
Summer  Research  Programs  (Contract  F9620-90-C-0076).  The  blanks  will  be  filled  by  merging 
information  from  our  dBase  IV  database. 

Sincerely, 

Abe  S.  Sopher 
Contracts  Manager 

cc:  AFOSR/NI  (Lt.  Col.  Cavendar) 
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ATTACHMENT  2: 


SAMPLE  TECHNICAL  EVALUATION  FORM  AND  TECHNICAL 
EVALUATION  SUMMARY 


1992  RESEARCH  INITIATION  PROGRAM  TECHNICAL  EVALUATION 


RIP  NO:  92-2 

RIP  ASSOCIATE:  Dr.  Charles  Kibert 


Provided  are  several  evaluation  statements  followed  by  ratings  of  (1) 
through  (5).  A  rating  of  (1)  is  the  lowest  and  (5)  is  the  highest. 
Circle  the  rating  level  number  you  best  feel  rates  the  statement. 
Document  additional  comments  on  the  back  of  this  evaluation  form. 

Mail  or  fax  the  completed  form  to: 

RDL 

Attn:  1992  RIP  TECH  EVALS 
5800  Uplander  Way 
Culver  City,  CA  90230-6608 
(Fax:  310  216-5940) 


1.  This  RIP  report  has  a  high  level  of  technical  merit  12345 

2.  The  RIP  program  is  important  to  accomplishing  the  lab's  12345 

mission 


3.  This  RIP  report  accomplished  what  the  associate's  proposal 
promised 

4.  This  RIP  report  addresses  area(s)  important  to  the  USAF 

5.  The  USAF  should  continue  to  pursue  the  research  in  this 

RIP  report 

6.  The  USAF  should  maintain  research  relationships  with  this 
RIP  associate 

7.  The  money  spent  on  this  RIP  effort  was  well  worth  it 

8.  This  RIP  report  is  well  organized  and  well  written 

9.  I'll  be  eager  to  be  a  focal  point  for  summer  and  RIP 
associates  in  the  future 

10.  The  one-year  period  for  complete  RIP  research  is  about 
right 


1  2  3  4  5 

1  2  3  4  5 

1  2  3  4  5 

1  2  3  4  5 

1  2  3  4  5 

1  2  3  4  5 

1  2  3  4  5 

1  2  3  4  5 


****n8E  THE  BACK  07  THIS  FORM  FOR  ADDITIONAL  COMMENTS**** 


LAB  FOCAL  POINT'S  NAME  (PRINT):  _ 

OFFICE  SYMBOL:  _  PHONE: 
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TECHNICAL  EVALUATION  SUMMARY 

Technical  Evaluation  Questionnaire  Rating  Factors 
Subcontract  no.  1  23A5678910  Average 

135  5  4  5  4  4  4  4  4  5  5  4.4 

50  4  4  5  4  4  4  4  3  5  5  4.2 

3  4333333334  3.2 

71  4  4  4  4  3  5  5  4  5  5  4.3 

152  3  4  3  4  4  3  4  3  4  5  3.7 

147  5  5  5  5  5  5  5  5  5  4  4.9 

oil  4  4  5  4  5  5  5  4  5  4  4.5 

106  5  5  4  5  5  5  5  5  5  5  4.9 

83  5  4  5  5  5  5  5  5  5  4  4.8 

26  5  4  4  5  5  5  5  5  4  4  4.6 

8  5344555355  4.4 

120  1  5  2  4  5  3  2  1  4  4  3.1 

133  3  2  4  5  5  4  3  4  3  5  3.8 

108  5  4  4  5  5  5  5  5  5  5  4.8 

76  5  5  5  5  5  5  5  5  5  3  4.8 

122  5  5  5  5  5  4  5  5  5  5  4.9 

92  4  5  5  5  5  5  5  5  5  5  4.9 

47  5  5  5  5  5  4  4  5  5  5  4.8 

57  4  4  4  5  5  4  4  4  4  2  4.0 

17  5  5  5  5  5  5  5  5  5  5  5.0 

5  5344455543  4.2 

62  5  4  5  4  4  5  5  5  5  5  4.7 

74  4  3  4  4  4  4  5  4  4  5  4.1 

53  4344343534  3.7 

84  5  4  4  5  5  5  5  5  5  4  4.7 

145  4  4  5  4  5  5  5  5  5  4  4.6 

35  5  5  5  5  5  5  5  5  5  5  5.0 
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Technical  Evaluation  Questionnaire  Rating  Factors 
1  23456789  10  Average 
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PARALLELISM  IN  USER  ASSISTED  INFORMATION  EXTRACTION 


Pradip  Peter  Dey 
Associate  Professor 
Computer  Science  Department 
Hampton  University 


ABSTRACT 

Since  information  extraction  from  formatted  and  unformatted  messages  cannot  be  fully 
automated  with  current  technology,  user  assisted  information  exuaction  is  considered  a  reasonable 
strategy  for  military  message  processing.  User  assisted  information  extraction  systems  are  good 
candidates  for  parallel  execution;  because  (1)  these  are  highly  complex  systems  that  run  slowly  on 
sequential  computers,  (2)  these  systems  must  respond  extremely  fast,  (3)  these  systems  exist  in  a 
multicomputer  (distributed)  environment  where  proper  utilization  of  resources  of  several  computers 
is  a  reasonable  strategy.  This  report  presents  results  of  a  systematic  study  of  parallel  processing 
architectures,  algorithms,  data  and  control  structures  for  a  user  assisted  information  extraction 
system  called  Generic  Intelligence  Processor  which  is  being  developed  in  evolutionary  stages  by 
Rome  Laboratory. 
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PARALLELISM  IN  USER  ASSISTED  INFORMATION  EXTRACTION 


Pradip  Peter  Dey 


1.  INTRODUCTION 

Certain  military  applications  require  rapid,  precise  and  error-free  information  extraction  (from 
formatted  and  unformatted  messages)  which  can  be  delivered  neither  by  fully  automated  systems  nor 
by  fully  manual  systems.  User  assisted  information  extraction  is  considered  a  reasonable  strategy  for 
these  applications.  User  assisted  information  extraction  systems  are  good  candidates  for  parallel 
execution;  because  (1)  these  are  highly  complex  systems  that  run  slowly  on  sequential  computers, 
(2)  these  systems  must  respond  extremely  fast,  (3)  these  systems  exist  in  a  multicomputer 
(distributed)  environment  where  proper  utilization  of  resources  of  several  computers  is  a  reasonable 
strategy. 

This  report  presents  a  systematic  study  of  parallel  processing  architectures,  algorithms,  data  and 
control  structures  for  a  user  assisted  information  extraction  system  called  Generic  Intelligence 
Processor  (GIP)  that  has  been  procured  by  the  IRDS  branch  of  Rome  Laboratory. 

One  of  the  goals  of  IRDS/RL  has  been  to  develop  and  test  GIP  in  the  real  world  situation.  For 
this  purpose  an  enhanced  version  of  GIP  is  under  development  now.  In  addition,  further  long-term 
enhancements  of  GIP  are  being  planned.  With  GIP,  IRDS  has  undertaken  a  challenging  task  of 
putting  an  emerging  technology  into  work.  GIP  is  proposed  to  be  developed  in  evolutionary  stages 
gradually  adding  new  components  and  enhancing  the  system  as  the  technology  develops.  GIP  has 
enormous  potential  for  technology  transfer  in  some  crucial  areas  as  it  provides  an  environment  for 
transferring  both  short  and  long-term  developments.  This  proposed  research  focuses  on  long-term 
enhancement  of  GIP  rather  than  immediate  enhancement  that  is  already  undertaken.  The  specific 
long-term  enhancement  that  is  to  be  addressed  here  is  the  parallelization  problem  of  GIP. 


2.  GENERIC  INTELLIGENCE  PROCESSOR  (GIP) 

A  detailed  description  of  GIP  is  available  in  a  technical  proposal  written  by  the  Knowledge 
Systems  Concepts,  Inc.  (KSC  1991).  The  main  goal  of  GIP  is  to  extract  information  from  formatted 
and  unformatted  messages  and  transmit  the  results  to  downstream  processes  such  as  expert  systems. 
GIP  is  a  toolkit  for  developing,  testing,  and  delivering  message  processing  applications.  When  fully 
developed  GIP  will  provide  the  following  capabilities  (a)  user  selectable  input  sources,  (b)  automatic 
message  parsing,  (c)  automatic/user  assisted  information  extraction,  (d)  automatic  output  formatting 
for  development  systems,  (e)  utilities  for  developing  new  algorithms.  For  processing  messages  with 
substantial  natural  language  texts  user  assistance  will  be  crucial  for  GIP,  because,  natural  language 
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processing  technology  is  not  mature  enough  for  fully  automated  information  extraction  in  an 
operational  environment.  Therefore,  there  is  considerable  emphasis  on  user  interface  in  the  GIF 
design.  The  GIF  program  components  are  reproduced  in  Figure- 1. 

As  mentioned  earlier,  GIF  is  a  complicated  system  requiring  enormous  computing  power.  Efficient 
execution  of  GIF  is  possible  only  by  clever  utilization  of  computing  resources  in  the  GIF 
environment. 

3.  FARALLEL  ARCHITECTURES  FOR  GIF 

There  are  three  basic  architecture  types:  (1)  SISD  =  single  instruction  single  data  stream. 
Traditional  sequential  computers  are  of  this  type.  A  single  instruction  is  executed  at  a  time  on  a 
single  piece  of  data.  (2)  SIMD  =  single  instruction  multiple  data  stream.  A  single  instruction  is 
executed  in  a  number  of  processors  on  different  pieces  of  data.  Synchronization  is  automatic  in  this 
architecture.  A  processor  either  executes  the  same  instruction  with  other  processors  or  remains  idle. 
The  connection  machine  is  an  SIMD  machine  (Hillis  1985).  (3)  MIMD  =  multiple  instmction 
multiple  data  stream.  Different  instructions  can  be  executed  in  different  processors  on  different 
pieces  of  data.  There  are  two  types  of  MIMD  computers:  (a)  Multicomputers  =  loosely  coupled 
systems.  In  this  architecture  each  processor  has  its  own  local  memory  and  communications  between 
the  processors  are  achieved  through  an  interconnection  network.  They  are  also  known  as  distributed 
systems.  The  n-cube  computers  are  of  this  type  (Athas  and  Seitz  1988). 

(b)  Multiprocessors  =  tightly  coupled  systems.  In  these  systems,  the  memory  is  common  to  all 
processors.  The  processors  share  a  number  of  global  variables  stored  in  the  common  memory. 
These  variables  enable  the  processors  to  communicate  efficiently  through  the  shared  memory  if  the 
number  of  processors  is  not  too  large.  Two  examples  of  commercially  available  shared  memory 
multiprocessors  are  the  BBN  Butterfly  system  (Rettberg  and  Thomas  1986)  and  the  Sequent  Balance 
system  (Osterhaug  1985).  Algorithms  designed  for  multiprocessors  are  known  as  asynchronous 
parallel  algorithms.  An  asynchronous  algorithm  is  a  collection  of  processes,  some  or  all  of  which 
are  executed  simultaneously  on  a  number  of  available  processors.  When  the  parallel  algorithm 
begins  execution  on  a  processor,  it  creates  a  number  of  processes  to  be  performed.  If  free 
processors  are  available,  these  processes  are  assigned  to  the  processors  to  perform  the  necessary 
computations.  Otherwise,  a  process  is  queued  and  waits  for  a  processor  to  be  free.  When  a 
processor  completes  execution  of  a  process,  it  becomes  free. 
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PROCESS  QUEUE 


[]  D  []  []  []  D  []  [] 

PROCESSOR  QUEUE 

Figure  2.  Two  Main  Queues  of  a  Multiprocessor 


Multiprocessors  are  very  popular  because  they  are  easy  to  program.  Multiprocessors  work  with 
two  queues:  a  process  queue  and  a  processor  queue.  The  main  thing  a  programmer  has  to  do  is 
create  an  optimal  number  of  processes  out  of  a  program.  The  processes  are  queued  in  the  process 
queue.  Free  processors  wait  in  the  processor  queue.  If  processes  are  waiting  in  the  process  queue 
then  they  are  assigned  to  available  processors  from  the  processor  queue.  After  a  process  is  executed, 
the  processor  becomes  free  and  is  queued  back  to  the  processor  queue. 

Besides  these  basic  architecture  types  described  above  there  are  reconfigurable  architectures 
that  can  be  conveniently  configured  as  shared  memory  or  distributed  memory  according  to 
computational  requirements  of  specific  applications. 

There  is  another  interesting  but  specialized  parallel  architecture  of  computation  known  as  the 
neural  network  in  which  a  large  number  of  processors  (similar  to  neurons)  are  interconnected  and 
processing  is  done  by  spreading  activation  (McClelland  et  al  1986).  By  passing  values  processors 
change  the  connective  strengths  among  themselves  and  propagate  activations  through  the  network. 
This  propagation  constitutes  processing  by  the  system.  The  neural  network  is  an  attractive  model  of 
parallel  computation  for  building  trainable  systems  (Howells  1988). 

When  fully  developed  GIF  would  be  a  very  large  and  complex  system  that  would  probably 
require  a  hybrid  architecture  for  efficient  execution  of  its  varied  components.  For  example,  neural 
network  architecture  may  be  needed  for  some  of  the  learning  components  of  GIF;  whereas  a 
multicomputer  architecture  could  be  used  for  the  natural  language  component.  The  shared  memory 
architecture  would  not  suitable  for  GIF  for  the  communication  bottleneck  the  architecture  creates 
when  the  number  of  processors  is  increased.  GIF  requires  a  scalable  architecture  because  the 
message  processing  load  of  GIF  would  vary  from  time  to  time.  A  cost  effective  parallel  architecture 
for  GIF  is  a  network  of  work  stations  with  client  server  computing  capabilities.  A  crucial  problem  of 
this  architecture  is  the  difficulty  in  parallelization  of  software.  An  experimental  parallel  execution 
module  has  been  developed  in  this  architecture  using  a  network  of  three  sun  workstations. 
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4.  PARALLEL  EXECUTION  MODULE 


A  parallel  execution  module  has  been  developed  that  runs  on  a  network  of  three  Sun 
workstations.  It  allows  execution  of  any  three  precompiled  programs  on  three  Sun  workstations  in 
parallel.  When  a  user  invokes  the  parallel  execution  module,  it  creates  three  distinct  processes  and 
executes  them  on  3  different  workstations  and  obtain  the  results  back  to  the  client  machine.  The 
user  has  the  freedom  to  choose  the  programs  to  be  executed  in  parallel.  The  remote  execution 
module  is  implemented  using  rep  and  rsh  protocols  of  the  Sun  Microsystems.  This  module  has  been 
delivered  and  demonstrated  to  Rome  Laboratory  in  August  1992.  This  module  produces  large  grain 
parallelism  at  a  high  level.  Distributed  execution  of  large  GIP  components  by  this  module  is  highly 
desirable  for  adequate  speedup.  It  takes  about  4  seconds  for  a  process  to  migrate  to  a  remote  server 
site  with  its  data.  Therefore,  there  would  not  be  substantial  speedup  by  parallel  execution  of  small 
GIP  modules.  The  level- 1  data  flow  diagram  (DFD)  of  the  parallel  execution  module  is  presented  in 
Figure  3. 


Figure  3.  Data  Flow  Diagram  of  the  Parallel  Execution  Module 
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5.  DATA  STRUCTURES 


Input  to  GLP  is  a  sequence  of  characters  given  from  text  files.  After  a  preliminary  error  checking 
process  the  data  is  broken  into  tokens.  Structural  and  semantic  analyses  are  then  performed  for 
information  extraction.  The  extracted  information  is  then  arranged  in  attributes  and  values.  The 
output  information  is  formatted  according  to  the  down  stream  processes.  The  parallel  execution 
module  can  adequately  deal  with  the  data  structures  required  by  GBP. 

6.  NATURAL  LANGUAGE  PROCESSING  IN  THE  GIP  ENVIRONMENT 

Although  Natural  Language  Processing  (NLP)  technology  is  not  mature  enough  for  fully 
automated  systems,  it  is  mature  enough  to  be  put  in  certain  real  world  use  with  appropriate  user 
assistance.  Information  extraction  from  messages  is  one  of  the  most  promising  fields  for  this 
technology.  User  assistance  would  be  required  in  the  extraction  process  until  some  future  break¬ 
through  makes  totally  automated  natural  language  understanding  possible.  GBP  requires  robust, 
user-oriented,  modifiable,  parallelizable  NLP  modules.  GIP  is  recommended  to  have  at  least  two 
alternative  NLP  modules  so  that  users  would  be  able  to  compare  the  results  of  the  two  NLP 
modules.  These  two  NLP  modules  should  be  run  in  parallel  for  obtaining  efficient  response  time  so 
that  user  assistance  can  be  planned  within  reasonable  time  limit. 

Robustness  and  Scalability  should  be  among  the  major  goals  of  the  NLP  modules.  NLP  systems 
require  large  knowledge  bases  for  achieving  robustness.  Some  learning  methods  should  be  used  to 
partially  automate  the  system  development  process.  Learning  methods  should  contribute  to 
scalability  by  making  it  feasible  to  develop  the  knowledge  bases  efficiently.  Learning  methods 
should  also  contribute  to  enhancibility  of  the  system.  The  requirements  of  the  NLP  modules  of  GIP 
can  be  briefly  specified  as  follows; 

(1)  The  GIP  NLP  modules  should  have  partially  automated  knowledge  acquisition  methods. 
That  means  some  learning  algorithms  should  be  used  for  acquiring  domain  knowledge.  GIP 
would  then  be  able  to  deal  with  the  diversity  of  domains. 

(2)  GIP  should  use  some  statistical  methods  to  guide  processing.  Statistical  methods  can  be 
successfully  used  in  dealing  with  ambiguity. 

(3)  GIP  would  require  well-documented  NLP  modules.  In  order  to  integrate  the  NLP 
modules,  documents  like  algorithm  descriptions,  software  design  documents  and  user’s 
manual  would  be  necessary.  Software  development  models  used  for  the  modules  should  be 
briefly  described. 

(4)  The  NLP  component  should  be  properly  modularized.  The  interfaces  between  modules 
should  be  well-defined. 

(5)  The  NLP  modules  should  allow  parallelization  for  efficient  execution. 

(6)  Scalability  should  be  considered  as  an  essential  feature  of  the  NLP  modules.  It  is  often 
required  to  incorporate  this  provision  in  the  design  level;  otherwise,  the  modules  have  to  be 
redesigned. 

(7)  The  NLP  modules  should  be  modifiable,  otherwise  GIP  will  not  be  able  to  deal  with  the 
emerging  technology. 
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Abstract 

An  efficient  noise  modeling  method  for  millimeter-wave  field  effect 
transistors  is  described.  The  method  differs  from  conventional  techniques  by 
avoiding  complications  associated  with  the  use  of  a  variable  impedance/admittance 
tuner  at  the  input  of  the  FET.  Noise  figure  is  measured  at  one  known  source 
admittance  state  over  several  frequencies.  A  knowledge  of  the  FET  equivalent 
circuit  model  is  assumed,  and  an  extraction  of  the  model  parameter  values  is 
demonstrated  on  both  a  low  noise  MESFET  and  PHEMT.  Noise  correlation  matrix 
techniques  are  applied  to  express  the  noise  figure  equation  in  a  generalized 
format  that  allows  various  intrinsic  FET  noise  source  representations.  Noise 
parameter  data  is  measured  at  microwave  frequencies  using  an  established  tuner 
based  system  for  model  comparisons.  The  sensitivity  of  the  noise  modeling  method 
and  measurements  are  addressed. 
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I .  Executive  Summary 


New  noise  modeling  research  is  presented  to  eliminate  expensive  hand  tuning 
and  improve  the  effectiveness  of  computer-aided-design  of  millimeter-wave  (mm- 
wave)  FET  low  noise  amplifiers.  This  research  is  very  relevant  to  the  goals  of 
the  Air  Force  and  of  Rome  Laboratory  towards  the  advancement  of  high  performance 
mm-wave  command  and  control  communications  and  intelligence  (C^I)  systems.  These 
systems  should  benefit  substantially  from  the  inclusion  of  low  noise  amplifiers 
(LNA's)  that  employ  state-of-the-art  mm-wave  FETs  (including  MESFETs  and  HEMTs), 
and  monolithic  microwave/mm-wave  integrated  circuit  (MMIC)  technology.  However, 
these  potential  benefits  cannot  be  realized  until  accurate  mm-wave  FET  noise 
characterization  becomes  possible.  Accurate  mm-wave  FET  noise  characterization 
is  not  currently  possible  with  conventional  techniques  used  at  microwave 
frequencies  (i.e.  below  30GHz). 


The  research  completed  here  supports  on-going  in-house  Air  Force  research 
in  the  Component  Technology  Branch  (ERAC)  at  Rome  Laboratory.  The  modeling 
technique  complements,  and  may  eventually  replace,  the  difficult  experimental 
noise  characterization  of  mm-wave  Indium  Phosphide  High  Electron  Mobility 
Transistors  (HEMTs).  These  HEMTs  are  currently  being  used  by  RL/ERAC  in  60GHz 
monolithic  low  noise  amplifiers.  Related  RL/ERAC  research  will  continue  with 
60GHz  monolithic  receive  circuits  and  120GHz  LNAs.  The  noise  modeling  technique 
reported  here  is  expected  to  contribute  to  both  of  these  efforts. 

The  research  advances  a  new  method  for  noise  modeling  of  mm-wave  field 
effect  transistors.  The  new  noise  modeling  method  was  the  subject  of  a  1991 
USAF-AFOSR  Summer  Research  Program.  The  method  avoids  the  complications  of  a 
variable  impedance  tuner,  and  requires  only  the  knowledge  of  a  small  signal 
equivalent  circuit,  and  noise  figures  measured  across  a  range  of  frequencies  for 
a  single  known  source  impedance.  Noise  parameters  derived  from  this  method  have 
demonstrated  promising  results  in  comparison  with  measured  data  taken  at 
microwave  frequencies.  However,  the  method  required  further  enhancement  and 
verification  for  reliable  application  to  mm-wave  noise  modeling  development.  The 
specific  tasks  of  the  1992  research  initiation  program  are  shown  in  Figure  1  and 
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Millimeter-wave  FET  Noise  Modeling  Approach 


Figure  1  Millimeter-wave  noise  modeling  development  plan  for  1992 


were  carried  out  as  five  tasks  as  follows; 

1)  Implement  necessary  improvements  to  the  method,  including  utilization  of  a 
more  general  formulation,  and  a  modified  computer  algorithm- 

2)  Investigate  the  method's  sensitivity  to  measurement  errors,  and  (if 
warranted)  explore  the  use  of  alternate  computational  technigues  for 
improving  the  method's  tolerance  to  measurement  errors. 

3)  Implement  an  improved  small  signal  modeling  method  to  separate  the  intrinsic 
and  extrinsic  FET  equivalent  circuit  parameters  in  a  more  physically 
meaningful  and  consistent  way. 

4)  Perform  microwave  noise  figure  measurements  for  method  verification,  and  to 
examine  the  validity  of  frequency  extrapolation  of  the  derived  noise  model. 

5)  Conducted  a  theoretical  investigation  of  alternative  mm-wave  noise  modeling 
techniques . 

This  research  provides  a  much  needed  mm~wave  FET  noise  characterization  method. 

The  use  of  FET  models  derived  from  this  method  will  reduce  design  costs,  and 

yield  high  performance  MMIC  LNAs  that,  in  turn,  will  improve  the  performance  and 

reduce  the  costs  of  the  Air  Force's  Future  c’l  systems. 
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II.  OBJECTIVES  AND  ACCOMPLISHMENTS 


Problem:  Conventional  microwave  noise  modeling  methods  utilize  a  variable 
impedance  (or  admittance)  tuner,  which  presents  a  sequence  of  "known"  source 
impedances  Z,  (admittances  Y,  )  to  a  device's  input  terminals.  The  tuner  must 
be  able  to  present  a  range  of  impedances  that  represents  a  good  coverage  of  the 
Smith  Chart,  including  near  the  edges  of  the  chart.  Hence,  the  tuner  must  have 
very  low  loss.  The  ability  to  fabricate,  and  characterize,  such  a  tuner  becomes 
increasingly  more  difficult  as  frequency  is  increased.  Research  and  commercial 
developments  are  being  pursued  in  the  industry  and  at  the  Air  Force  towards 
implementing  mm— wave  tuners  for  noise  parameter  measurements.  Because  of  the 
inherent  difficulties  with  this  approach,  however,  it  is  essential  that 
alternatives  be  explored  to  the  reliance  on  a  variable  impedance  tuner  for  mm— 
wave  FET  noise  characterization. 


Research  Objectives;  The  objectives  of  the  performed  work  were  to;  advance  and 
verify  a  new  method  for  FET  noise  model  development  formulated  as  part  of  a 
Summer  research  project'.  Long  range  goals  for  the  anticipated  multi-year 
research  initiated  with  this  project  are  to  investigate  FET  noise  modeling 
methods  applicable  at  mm-wave  frequencies  (i.e.  frequency  >  30GHz) .  Also  to  be 
investigated  are  the  inclusion  of  temperature  effects  in  microwave  and  mm-wave 
noise  models. 

Scope  of  Research:  The  performed  work  comprised  the  first  year  of  a  multi-year 

investigation  of  microwave  and  mm-wave  device  noise  characterization. 
Advancement  of  the  new  FET  noise  modeling  method  discussed  above  was  carried  out 
at  the  University  of  South  Florida.  The  work  involved  computer  program 
development  ,  FET  measurements,  CAD  program  use,  a  theoretical  investigation,  and 
numerical  experiments. 

Specific  Objectives  vs.  Accomplishments:  A  summary  of  the  tasks  performed  and 

the  objectives  that  were  accomplished  are  given  as  follows: 

1)  Objective  -  Advance  and  refine  the  new  noise  modeling  method. 

Accomplishment  -  A  direct  solution  for  the  noise  parameters  was  derived  using 
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noise  correlation  matrix  methods  replacing  the  synthesized  tuner  measurement  of 
the  previous  algorithm  (See  Section  III.  A.).  In  addition,  a  generalized  noise 
modeling  method  was  developed  using  the  correlation  matrix  methods  learned  by 
implementing  the  direct  noise  parameter  solution  (See  Section  III.  B. ) . 

2)  Objective  -  Investigate  method  sensitivity  to  errors. 

Accomplishment  -  Variations  of  the  noise  coefficients  P,  R,  and  C  were  studied 
with  respect  to  sets  of  statistically  perturbed  sets  of  input  data  (noise  figure 
and  reflection  coefficient).  See  Section  V. 

3)  Objective  -  Implement  an  improved  small  signal  modeling  method. 
Accomplishment  -  After  a  review  of  FET  small  signal  model  extraction  techniques 
it  was  concluded  that  the  extraction  of  the  device  parasitic  components  was  most 
important  to  improving  existing  extraction  techniques.  Therefore,  a  paper  study 
was  conducted  with  corresponding  measurements  for  the  extraction  of  MESFET  and 
HEMT  extrinsic  parameter  element  values.  Subsequently,  a  procedure  was 
documented,  where  small  signal  measurements  were  made  and  model  element  values 
extracted  for  both  a  low-noise  MESFET  and  a  PHEMT  (See  Section  IV).  Comparisons 
were  made  to  a  conventionally  derived  "brute  force  optimized"  model.  Microwave 
noise  figure  and  source  reflection  coefficient  measurements  were  performed  at 
U.S.F.  and  used  to  compare  the  resulting  P,  R,  and  C  values  using  both  small 
signal  models. 

4 )  Method  verification  with  microwave  noise  measurements. 

Accomplishments  -  An  automatic  noise  parameter  test  set  (ATN's  NP5a)  was 
assembled  and  configured  as  part  of  this  project  for  operation  with  on-wafer 
microwave  S-parameter  measurement  capabilities  at  U.S.F.  Two  devices  were 
characterized  using  the  ATN  system  over  a  2.0  to  18.0GHz  range.  The  noise 
parameters  of  a  MESFET  and  a  PHEMT  were  used  for  noise  modeling  purposes. 
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III.  Advancement /Ref inement  of  the  Noise  Modeling  Method 


The  development  of  a  noise  modeling  algorithm  that  only  requires  knowledge 
of  the  FET  equivalent  circuit  parameters  and  the  measurement  of  noise  figure  over 
several  frequencies  at  one  known  source  impedance  has  been  described'.  This 
technique  can  significantly  simplify  the  determination  of  device  noise  parameters 
at  millimeter-wave  frequencies,  because  the  technique  is  not  reliant  upon  a 
variable  source  impedance  tuner.  The  previously  developed  algorithm  is  shown 

in  Figure  2 . 

The  previous  approach  proceeds  with  a  determination  of  the  noise 
coefficients  P,  R,  and  C,  through  a  least  squares  fit  of  data  to  a  linearized 
equation  for  the  noise  figure.  Then  the  device  noise  parameters  are  solved  for 
by  numerically  synthesizing  a  tuner  based  measurement  data  set  and  calculating 
the  noise  parameters  using  a  standard  technique^.  This  artificial  measurement 
was  implemented,  because  a  direct  solution  for  the  noise  parameters  was  not 
obvious.  This  is  due  to  the  additional  algebraic  complexity  that  arises  when  the 
circuit  elements  and  Cj,  are  incorporated  in  the  FET  model. 

In  this  report  a  direct  solution  is  obtained  for  the  noise  parameters.  The 
solution  involves  a  matrix  based  representation  of  the  equivalent  circuit  of 
Figure  3,  which  applies  to  various  topologies  and  noise  source  representations. 
The  noise  correlation  matrix  method'*  is  used  to  develop  a  direct  solution  for  the 
noise  parameters,  that  simplifies  the  algorithm  of  Figure  2. 


A  Direct  Solution  for  the  Noise  Parameters 


The  parameters  P,  R,  and  C  in  the  FET  noise  model  of  Figure  3.  are 
necessary  to  describe  the  intrinsic  gate  and  drain  noise  sources  i^  and  i^ .  The 
noise  correlation  matrix,  previously  described'',  allows  for  Z,  Y,  and  ABCD  matrix 
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algorithm  resulting  from  this  work  (right). 
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Figure  3  Field  effect  transistor  small  signal  and  noise  model  used  for  this  work. 


representations  of  the  noise  power  spectral  densities  related  to  the  noise 
sources  and  their  correlation.  Consider  a  set  of  noise  matrices  associated  with 
the  interconnection  of  two-ports  shown  in  Figure  4.  These  noise  matrices  can  be 
transformed  between  the  various  representations  with  the  appropriate  transform 
matrix,  and  a  knowledge  of  the  circuit  elements  through  which  the  transform  is 
applied.  Therefore,  with  P,  R,  and  C  determined  by  the  single  tuner  state 
algorithm,  a  synthesis  of  the  various  two-ports  that  comprise  the  equivalent 
circuit  model  defines  the  noise  correlation  matrix  of  the  total  extrinsic  FET 
model . 

The  total  extrinsic  noise  correlation  matrix  is  developed  by  first 
describing  the  intrinsic  FET,  and  then  interconnecting  the  parasitic  networks 
according  to  Figure  4.  This  is  done  by  determining  the  coefficients  P,  R,  and 
C  associated  with  the  intrinsic  network.  The  intrinsic  FET  correlation  matrix 
is  then  represented  in  a  Y  parameter  form  as: 


Sm 


-jo>C^,C^/RP  g„P 


(1) 


The  analysis  for  the  interconnection  of  noisy  two-ports  parallels  that  of  linear 
network  theory.  The  details  of  converting  and  connecting  noise  matrices  is 
addressed  by  Hillebrand  and  Russer'*.  This  interconnection  of  noisy  two-ports 
proceeds  as  follows: 

1.  Form  the  series  connection  of  the  intrinsic  FET  network  and  the  source 
parasitic  network  to  develop  the  noise  matrix  Czi,  and  the  related 
network  matrix  . 

2.  Transform  the  Z-parameter  matrices  of  step  1.  into  ABCD-parameter 
forms . 

3.  Form  the  ABCD  noise  and  network  matrices  for  the  gate  and  drain 
parasitic  networks. 

4.  Cascade  the  interconnected  linear  two-port  ABCD  matrices  to  form  the 
complete  model  network  matrix  and  noise  correlation  matrix  C^. 

Certain  network  parameter  representations  may  not  exist  in  general  for  all 
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Figure  4  The  small  signal  equivalent  circuit  element  model  (top)  is  redrawn 
as  an  interconnection  of  two-ports  (bottom) .  Each  two-port  has  a 
related  noise  correlation  matrix  and  a  network  parameter  matrix. 
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elements.  For  example,  the  series  Z-parameter  representation  of  the  gate  and 
drain  parasitics  does  not  exist.  Therefore,  the  computer  algorithm  reflects  the 
basic  transforms  required  to  achieve  the  final  ABCD  or  chain  matrix 
representation.  The  noise  correlation  matrices  given  here  have  been  normalized 
by  a  factor  of  2kT  to  simplify  the  expressions,  this  term  ultimately  factors  out 
of  the  final  noise  parameter  equations.  The  resulting  chain  matrix,  C^,  is  given 
in  terms  of  the  FET  noise  parameters  by'*: 


R 


F  .  -  I 

mm  _  1?  V 

- T -  ^s,opl 


-  1 


-  R  Y 

^  s,opt 


KKJi 


(2) 


The  noise  parameters  may  now  be  solved  for  directly  with  the  following  results  •  : 


^mm  ^  ^AM  ^s.oplj 


2 

c 

^A,2l 

c 

j  ^A,\2 

c 

T  '-'-4,12 

> 

c 

■‘m  ^ 

.  - 

m  ^ 
'^AM 

It  is  notable  that,  for  the  field  effect  transistor  model  commonly  used  at 
microwave  and  millimeter-wave  frequencies,  the  matrix  forms  a  standard  noise 
model,  through  which  various  noise  source  representations  can  be  compared.  That 
is,  a  particular  intrinsic  noise  source  representation  can  then  be  implemented 
within  this  model,  provided  that  these  sources  are  known.  The  following  section 
extends  the  use  of  the  correlation  matrix  technique  and  the  formulation  of  the 
single  source  state  noise  modeling  method  by  generalizing  the  noise  figure 
equation  from  which  the  intrinsic  sources  are  described. 
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A  Generalized  Noise  Figure  Equation 


Essentially,  the  noise  modeling  method  presented  above,  has  two  input  data 
sets;  l)the  small  signal  model  parameters  and  2) the  noise  coefficients  P,  R,  and 
C.  These  coefficients  are  calculated  from  a  linearized  form  of  the  noise  figure 
equation,  which  was  derived  from  a  nodal  circuit  analysis'’^.  In  this  work  an 
equation  for  device  noise  figure  is  derived  using  noise  correlation  matrices. 


The  noise  figure  of  an  active  two-port  is  described  in  terms  of  its  noise 
parameters  by 


R 

F  =  F .  +4 

mm  2 


r.  ■ 

- 1 

1 

opt 

2 

I1 1] 

1 

(i  -Hi 

(4) 


The  noise  figure  associated  with  a  particular  source  tuner  state,  F,,  is  the  most 
directly  measurable  parameter  in  (4).  Automated  systems  make  use  of  redundant 
sets  of  noise  figure  data  for  the  analysis  of  device  noise  parameters^’*, 
avoiding  the  alternative  search  for  ?„„„  and  F,^’.  Again,  the  method  described 
here  requires  a  small  signal  equivalent  circuit  model  and  a  measurement  of  noise 
figure  over  several  frequencies  at  only  one  source  admittance.  The  intrinsic  FET 
noise  model  is  characterized  by  the  equivalent  gate  and  drain  noise  current 
sources  i^  and  ij  respectively*.  These  intrinsic  noise  sources  are  in  general 
defined  by  the  user.  These  are  the  intrinsic  sources  that  were  described  by  P, 
R,  and  C  in  the  noise  correlation  matrix  of  the  direct  solution.  The  noise 
figure  is  written  is  terms  of  the  total  FET  noise  correlation  matrix  as. 


F  = 


Z*  [Cj  Z 

1  + _ LiJ— 


(5) 


where  the  intrinsic  FET  correlation  matrix  is  embedded  within  the  extrinsic 
equivalent  circuit  model  matrix  Z.  Therefore,  the  FET  modeling  may  in  general 
be  implemented  with  a  two  parameter .noise  model  using  noise  temperatures  T^  and 
T/  or  a  four  parameter  model  with  sources  e^  and  ij  having  complex  correlation*. 
For  the  present  sources,  i^  and  i^,  a  fit  is  made  to  the  measured  noise  figure  by 
the  parameters  P,  R,  and  c'’^  The  parameter  C  is  complex,  but  assumed  only 
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imaginary  for  this  analysis*. 

The  noise  figure  equation  of  a  pair  of  correlated  noise  sources  with 
spectral  intensities  |  i]  |  ,  \il\  ,  \  ij'  \  and  |  i^ij  \  is  in  general  a  quadratic  form 

that  for  the  case  of  i^  and  i<,  may  be  represented  in  terms  of  the  noise 
coefficients  P,  R,  and  C  as 

F  =  M,  +  R  +  P  +  C 

By  identifying  a  noise  correlation  matrix  for  the  intrinsic  FET  model  and  the 
embedding  parasitics,  a  generalized  algorithm  is  readily  developed  for  FET  noise 
modeling.  Correlation  matrices  provide  a  direct  means  of  determining  the  "M" 
coefficients  of  equation  (6)  for  various  small  signal  equivalent  circuit  model 
topologies.  The  following  analysis  will  reformulate  the  coefficients  M, /  M2,  M3, 
and  M4  in  the  noise  figure  equation  through  application  of  the  correlation  matrix 
which  represents  the  complete  extrinsic  FET  noise  correlation  matrix.  The 
noise  figure  equation  in  terms  of  is  given  by  (5)  where 


Now,  in  equation  (7)  the  noise  correlation  matrix  of  the  intrinsic  FET,  Czi, 

^  ^  z-  (r  .  BCJ')  Z  (7) 

results  from  the  correlated  gate  and  drain  noise  current  sources  of  the  small 
signal  FET  model.  This  intrinsic  noise  correlation  matrix  was  previously  written 
in  equation  (1),  in  admittance  form,  for  a  three  parameter  model  with  noise 
coefficients  P,  R,  and  C. 
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The  B  elements  of  (7)  are  noise  transformation  matrices  for  a  Z-matrix  to  chain 

matrix  transform.  For  the  FET  model  all  the  components  of  (2)  are  2x2  matrices 
and,  the  contribution  of  the  extrinsic  parasitic  networks  are  elements  of  the 

matrix  T.  Combining  (5)  and  (7),  the  coefficient  M,  is  written  in  terms  of  the 
matrix  T  ; 


M,  =  1  + 


Z*  TZ 


Before  solving  for  the  coefficients  Mi,  M3,  M4,  by  expanding  the  intrinsic  term 
B  C  ■  B  ,  let  C  represent  this  intrinsic  term.  From  equation  (7)  comes  the 


quadratic  form: 


C  -  Cj,  +  +  ^n^gen  ^22^gen^sen 

which,  is  used  to  relate  the  expansion  of  B  B  to  the  M2, 3, 4  coefficients. 
Using  the  noise  correlation  matrix  representation  the  intrinsic  term  is: 

c  -  \  4  z,  2;  A-  a; 

where,  Aj=  gate  parasitic  network  chain-matrix,  Z-,  =  intrinsic  FET  impedance 
matrix,  and  A;,  =  the  Z  to  chain-matrix  noise  transform  of  the  intrinsic  FET  and 
the  series  connected  source  network.  For  simplicity  (10)  is  rewritten  as: 


C  =  X  Y  ,  where  X  =  Z.  and  Y  =  X*  . 

Equating  (11)  with  (9)  provides  a  solution  for  the  M2,3,4  coefficients: 


1  (wC  f  1 

M2  =  +  ^11  +  X2,  +  Zj,  y,2Z^^Zgg„  -  — — 
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(13) 


M,  = 


1^12^21  ^12^22^gen  ■^22^21^««’  ^22^22^gen^l 


'gen 


s„ 


gen) 


(■^21^21“  ^22^n)^gen  ^  {^21^22  ~  ^22^n)^gen^g^ 

With  the  Ml  and  M23,4  determined,  the  coefficients  P,  R,  and  C  are  determined 
from  the  noise  figure  data,  which  is  modeled  by  a  least  squares  fitting  of  a 
linearized  form  of  equation  (2).  Once  the  internal  noise  sources  are  de-embedded 
and  characterized,  the  extrinsic  FET  correlation  matrix  is  determined  over  the 
simulation  frequency  range  by  embedding  the  characterized  sources  within  the 
noise  model  as  presented  in  the  previous  section. 

This  development  can  be  quickly  extended  to  a  four  parameter  model  with 
noise  coefficient  C  =  c'  +  jC  .  The  resulting  model  would  include  a  value  Mj 
corresponding  to  the  real  part,  (C)  of  the  coefficient  C.  Additionally,  the  two 
parameter  model  of  Pospieszalski’  could  be  implemented  with  parameters  T^  and  T^ 
representing  equivalent  gate  and  drain  noise  temperatures.  Coupled  with  the 
direct  noise  parameter  solution,  a  general  noise  modeling  algorithm  has  been 
developed  that  can  be  used  to  implement  various  noise  source  representation 
within  the  single  source  state  modeling  method. 


(14) 
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IV.  Small  Signal  Modeling/Extraction/Extrapolation 
Measurements  and  extraction  procedures  required  to  develop  accurate 
millimeter-wave  FET  small  signal  models  are  discussed  in  this  section.  An 
accurate  extraction  of  small  signal  equivalent  circuit  parameter  values  (ECP) 
relates  directly  to  the  integrity  of  the  resulting  noise  model.  Here  a  brief 
discussion  of  several  small  signal  model  investigations  is  first  given.  Then  a 
following  section  presents  the  results  of  a  measurement  based  modeling  method. 

How  high  in  frequency  can  the  extracted  ECPs  be  used  for  extrapolation  of 
S-parameters  beyond  the  range  of  measured  data?  At  issue  is  the  model  and 
measurement  accuracy  and  how  sensitive  the  resulting  representation  is.  Early 
work  by  Yeom  et.  al'®  and  Curtice  and  Camisa"  describes  the  resulting  quality  of 
an  extrapolation  versus  frequency  and  suggest  an  accurate  predictability  up  to 
50  GHz  for  an  optimized  model.  Other  work  by  Vaitus'^  investigated  S-parameter 
sensitivity  to  variation  of  the  model  ECP  values,  but  the  modeling  lacked  a 
physical  significance  with  which  to  establish  a  means  of  calculating  an  expected 
error  versus  an  extrapolation  of  S-parameters  over  frequency.  While  the  more 
recent  extraction  methods  for  determining  model  ECP  values  have  improved  the 
confidence  of  the  small  signal  modeling  procedure  in  predicting  S-parameters, 
the  uncertainties  in  these  techniques  are  critical  to  the  extrapolation  of  device 
performance.  One  of  these  uncertainties  addressed  here  is  the  "Cold  FET" 
extraction  of  extrinsic  resistance  values. 

Continuing  with  a  review  of  FET  small  signal  model  extraction,  it  is 
notable  that  uncertain  or  inadequate  models  mis-calculate  certain  figures  of 
merit.  Several  groups  including  work  by  Trew  and  Steer'^  and  H.  Vickes’"*  have 
investigated  not  only  the  modeling  of  S-parameters,  but  have  included  k-f actor, 
unity  current  gain  frequency  f, ,  and  maximum  frequency  of  oscillation  F^^  . 

These  parameters  are  used  as  figures  of  merit.  Successful  prediction  of  these 
quantities  is  not  guaranteed  by  an  S-parameter  fit,  particularly  when  measured 
data  is  taken  below  the  expected  value  of  a  figure  such  as  F^^.  More  detailed 
studies  in  this  respect  include  work  by  Vickes*,  Hughes  and  Tasker'^,  P.  Roblin'  , 
and  M.  Feng'^  which,  provide  measurement  and  modeling  techniques  that  highlight 
the  effects  of  the  parasitic  ECP  values  on  model  performance  and  frequency 
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extrapolations.  In  particular  a  general  partitioning  of  the  gain  between  the 
intrinsic  and  extrinsic  circuit  shows  the  ability  to  predict  several  possible 
"regions"  for  device  performance  at  extrapolated  frequencies  due  to  the 
interrelation  of  the  intrinsic  and  extrinsic  elements^.  This  modeling  approach 
is  supported  both  by  actual  device  measurements^  and  by  physics  based  numerical 
simulation’. 

In  Fulfilling  our  aim  to  further  the  development  of  the  noise  modeling 
procedure',  an  ECP  extraction  procedure  is  used  along  with  on-wafer  S-parameter 
measurements.  As  with  all  the  present  extraction  methods,  an  improvement  over 
"brute  force  optimization"  is  required  to  determine  accurate  parasitic  element 
values.  A  review  of  published  extraction  techniques  reveals  that  no  single 
technique  appears  suited  for  all  devices.  This  is  particularly  true  for  the  case 
of  physical  differences,  such  as  those  between  HEMTs  and  MESFETs,  are  considered. 
What  is  sought  is  a  method  which  allows  an  extraction  of  parameter  values  that 
remain  valid  under  frequency  extrapolation  of  device  small  signal  and  noise 
performance.  Also  important,  is  an  understanding  of  the  uncertainty  of  the 
extracted  values  and  also  an  understanding  of  the  sensitivity  of  model 
predictions  to  the  element  variations. 

FET  Equivalent  Circuit  Parameter  Extraction 


Improvement  of  the  noise  modeling  method  is  expected  through  implementation 
of  direct  extraction  of  equivalent  circuit  parameter  values  (ECPs).  Extraction 
of  FET  equivalent  circuit  parameter  values  from  S-parameter  data,  using  forward 
gate  bias  along  with  normal  operating  bias  conditions  on  the  FET,  are  referred 
to  here  as  "Cold  FET"  extractions.  As  demonstrated  by  Dambrine  et.  al.‘*  and 
others.  Improvements”  to  Cold  FET  techniques  have  shown  frequency  independent 
parameter  extraction  to  26.5  GHz.  Limitations  for  application  of  these  methods 
to  short  gate  length  devices  and  HEMTs  are  attributed  to  the  sensitivity  (or  lack 
thereof)  in  the  S-parameter  measurements'®. 

The  USF^‘  extraction  technique  is  first  demonstrated  here  on  a  0.5  x  600pm 
GaAs  MESFET  provided  by  M/A-COM.  While  this  device  is  designed  with  a  power 
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process,  it  was  provided  with  process  parameter  data  that  allows  an  investigation 
of  the  extraction  technique.  The  technique  employed  is  similar  to  that  described 
by  Dambrine  and  proceeds  as  follows: 

1.  Perform  Cold  FET  measurements.  The  Cold  FET  bias  levels  used  are 

between  11.0  mA  and  18.0  mA  of  gate  current.  Assume  that  the  channel 
resistance,  is  negligible  (this  will  be  discussed  in  detail 

later).  Then,  extrapolate  a  value  of  Rj+Rg  from  a  plot  of  Re[Zll] 
versus  1/Ig. 

2.  Extract  parasitic  resistances  from  the  highest  forward  biased  data 
set. 

3.  Then  for  this  example,  use  the  R,+Rg  value  that  was  extrapolated  in 
step  1.  to  extract  an  approximate  value  of  the  channel  resistance, R^. 
Where  the  values  of  R,  and  R^  are  supplied  in  process  data  sheets. 

While  process  variation  of  the  internal  MESFET  may  be  expected,  the  extrinsic 
inductance  and  resistance  values  are  assumed  accurate.  A  second  small  signal 
model  is  also  extracted,  for  comparisons,  from  measured  S-parameter  data,  but  the 
supplied  parasitic  resistance  values  are  used  in  the  analysis.  Proceeding  with 
step  1,  from  the  extrapolation  to  the  origin  of  the  Re[Zll]  versus  1/Ig  the 
value  Rg+R,+R^/3  is  obtained.  A  value  of  Rg+Rs=  3.2  Ohms  was  determined,  for  R^ 
=  0.0  as  an  initial  approximation.  The  extraction  of  the  parasitic  values  R,, 
Rg,  Rj,  L,,  Lj  and  Lg  proceeded  with  the  values  given  in  Table  1.  Using  the  M/A- 
COM  supplied  extrinsic  resistances  a  value  of  the  channel  resistance  is 
calculated  from  +  RJ?)  =  3.2,  such  that  =  3.81  Ohms.  This  value  is 

large  enough  to  affect  a  mis-calculation  of  the  drain  parasitic  resistance,  R^, 
which  is  dependent  upon  the  full  value  of  R^  not  Rd,/3,  see  Dambrine'*.  The 
resulting  model  parameter  values  are  compared  in  Table  1  versus  the  typical 
process  parameter  values  supplied  by  M/A-COM  and  the  set  of  parameters  derived 
from  the  M/A-COM  extrinsic  resistance  values  and  the  USF  measured  S-parameter 
data.  Comparing  the  two  extracted  parameter  data  sets  the  values  of  g„,  R^,.  and 
T  are  most  affected  by  the  modeling  of  the  extrinsic  resistances.  The 

extracted  inductance  values  agree  well  with  the  supplier  process  data.  It  is 
apparent  that  the  un-modeled  channel  resistance  in  the  USF  extrinsic  modeling 
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technique  is  significant  and  this  resulted  in  a  larger  S-parameter  modeling 


error. 


Table  1  Small  Signal  Model  ECP  Values;  Compares  supplier  average  process 
values  versus  USF  extraction  using  supplier  extrinsic  resistance  values  and 
also  baseline  USF  extraction  of  both  parasitic  and  intrinsic  element  values. 


Parameter 

Normal  Bias 
Conditions 

M/A-COM 

Process 

Data 

USF/ 

M/A-COM 

USF 

Extracted 

Unit 

4V,  50%  Id,. 

0.606 

0.649 

0.716 

pF 

Cgd 

0.048 

0.052 

0.050 

pF 

Cd, 

0.150 

0.162 

0.182 

pF 

Ri 

0.732 

1.532 

1.203 

Ohm 

9m 

78.0 

66.878 

73.5 

mS 

Rds 

135 

175 

148 

Ohms 

Td 

4.0 

3.36 

2.88 

pSec 

R. 

0.75 

0.75 

1.936 

Ohms 

Rd 

1.104 

1.104 

2.88 

Ohms 

^8 

1.18 

1.18 

1.264 

Ohms 

L. 

0.003 

0.0065 

0.006 

nH 

Ld 

0.05 

0.060 

0.060 

nH 

1-8 

0.04 

0.046 

0.046 

nH 

Avg.  S-parameter  Modeling 

0.11142 

0.00410 

0.01066 

Error 


The  resulting  FET  model  is  compared  in  Figure  5a.  versus  the  measured  data.  The 
difference  in  modeled  S-parameters  is  increasing  with  frequency.  K-f actor,  !h2il, 
and  unilateral  gain,  U,  compare  well  as  shown  in  Figures  5b.  and  5c. 


Results  for  a  low  noise  0.5/im  MESFET  are  given  in  Table  2.  Here  the 
channel  resistance  value  was  initially  set  to  zero,  but  due  to  a  lack  of 
parameter  process  data  was  determined  through  a  numerical  search.  A  value  of 
=  1.3  Ohms  was  used  for  the  extraction  and  the  average  S-parameter  modeling  error 
reflects  the  improvement  noted  in  the  M/A-COM  device  model.  Further  optimization 
given  in  Table  2.  does  show  a  significant  (2  Ohm)  change  in  the  extracted  value 
of  R<i,  but  it  is  suspected  that  a  frequency  dependence  that  is  observed  in 
several  of  the  intrinsic  parameters  (particularly  Rj,)  may  affect  the 
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Figure  5b  Extracted  Small  Signal  Model  Performance  (MESFET)  versus 
measured  data. 
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procedure.  An  improved  fit  of  Sji  is  achieved  as  a  result  of  optimization,  using 
the  extracted  model  as  a  starting  point. 


Table  2  Small  Signal  Model  ECP  Values  (MESFET  LN300) ; 

Compares  optimized  ECP  values  versus  USF  extraction. _ _ 


Parameter 

Normal  Bias 

USF 

USF 

Unit 

Conditions 

— 

Extracted 

Optimized 

Extracted 

251.8  pF 

39.74  pF 

105.88  pF 

3.335  Ohm 

32.63  mS 

328.33  Ohms 

3.0  pSec 


R, 

2.75 

2.065 

Ohms 

Rd 

2.80 

0.813 

Ohms 

1.18 

1.6 

Ohms 

L, 

2.89 

2.065 

nH 

Ld 

68.8 

60.04 

nH 

Lg 

63.8 

61.77 

nH 

Avg.  S-parameter  Modeling 
Error 

0.0067 

0.00054 

The  resulting  FET  model  compares  well  with  the  measured  data. 

Results  for  a  0.25^m  PHEMT  are  given  in  Table  3.  Cold  FET  extrinsic  values 
were  extracted  at  15.0mA  of  gate  current  or  3.0  X  10*A/m^.  For  a  zero  valued 
channel  resistance  the  extracted  values  of  the  extrinsic  resistances  are  given 
in  Table  3.  under  the  USF  Extracted  column  heading.  The  phase  of  S22  and  the 
magnitude  of  S,,  differed  from  the  measured  data  by  4%.  Additionally,  the 
unilateral  gain  and  K-f actor  deviated  from  the  measured  data. 

In  this  case  the  initial  error  function  was  comparable  to  that  achieved  for 
the  M/A-COM  device  when  the  process  determined  extrinsic  values  were  used  for 
modeling.  Therefore,  the  un-modeled  channel  resistance  and/or  un-modeled  effects 
at  the  PHEMT  source  and  drain  contacts  were  considered  to  be  less  dominant.  That 
is  the  values  of  the  extrinsic  resistances  are  large  compared  to  the  value  of  the 
channel  resistance  and  its  fractions.  The  S-parameters  are  predicted  accurately 
(Figure  6a  and  6b),  but  the  unilateral  gain  and  K-factor  differ  from  the  measured 


4V,  20.0mA 


R; 


265.0 

37.0 

114.0 

2.9 

35.0 

289.8 

2.4 
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data.  A  brief  gradient  optimization  of  the  model  (less  than  15  iterations) 
quickly  reduced  these  differences  in  the  S-parameters  k-factor  and  unilateral 
gain. 


Table  3  Small  Signal  Model  ECP  Values  (PHEMT) :  Compares 

optimized  ECP  values  versus  USE  extraction. _ _ 


Parameter 

Normal  Bias 
Conditions 

Brute 

Force 

Optimized 

USF 

Extracted/ 
Opt imized 

Unit 

3V,  13.0mA 

89.7 

105.51 

pF 

21.12 

16.26 

pF 

Cds 

29.82 

48.51 

pF 

Ri 

1.26 

1.44 

Ohm 

gm 

40.23 

45.84 

mS 

Hds 

373.26 

305.24 

Ohms 

Td 

0.82 

0.348 

pSec 

R, 

1.32 

4.47 

Ohms 

Rd 

2.64 

6.39 

Ohms 

Ri 

4.29 

4.22 

Ohms 

L. 

6.10 

3.32 

nH 

Ld 

66.37 

50.34 

nH 

Li 

49.19 

51.57 

nH 

Avg.  S-parameter  Modeling 
Error 

0.0026 

0.00029 

Discussion 

A  small  signal  modeling  procedure  was  investigated  to  update  the  "brute 
force"  optimization  procedure  followed  in  the  previous  noise  modeling  work'.  The 
determination  of  equivalent  circuit  parameter  values  by  a  method  similar  to 
Dambrine  et.  al.'*’^‘  was  experimentally  tested.  The  final  resulting  models  are 
considered  to  be  more  accurate  for  noise  modeling  purposes,  because  the  extrinsic 
element  values  are  determined  directly  from  measured  data.  These  cold  FET 
extracted  models  have  a  lower  average  S-parameter  modeling  error  than  the  brute 
force  optimized  models.  Consistency  in  the  ability  of  the  Cold  FET  extracted 
models  to  predict  and  extrapolate  small  signal  gain  parameters,  such  as  the 
unilateral  gain  is  shown. 

The  uncertainty  that  exists  in  the  extraction  of  parasitic  resistance 
values  was  investigated,  for  Cold  FET  measurements.  As  Dambrine  et.  al.'^  note. 
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Figure  6a  Extracted  Small  Signal 
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an  additional  relation  (technological  process  data)  or  measurement  (DC  data)  is 
needed  to  account  for  the  channel  resistance,  '^he  determination  of  the 
parasitic  resistance  is  uncertain,  because  the  Cold  FET  technique  establishes  a 
system  of  three  independent  equations'*,  but  along  with  the  parasitic  resistors 
the  channel  resistance  is  a  fourth  unknown.  Without  additional  information  about 
R;,!,  a  numerical  search  for  appropriate  values  was  investigated. 

Additional  measurements  in  this  regard  are  suggested,  although  for  the  case 
of  the  PHEMT  the  standard  DC  measurements  may  be  inadequate.  The  concern  here  is 
to  avoid  parallel  conduction  in  the  doped  layer  (AlGaAs  for  GaAs  HEMTs) [H. 
Morkoc].  Of  the  published  literature  the  work  of  Fu  et.  al.  directly  addresses 
the  measurement  and  modeling  of  short  (length  <  0.1)jm)  gate  length  HEMTs. 

Additional  uncertainty  also  exists  in  choosing  the  extracted  values  from 
a  set  of  data  that  exhibits  a  dispersion  of  the  parameter  values  versus 
frequency.  While  the  extrinsic  resistance  data  (Re[Zll])  may  vary  by  less  than 
0.5  Ohms  from  1.0  to  26.0  Ghz,  the  values  of  Rd3  may  vary  by  30%.  Also,  a  better 
"choice"  of  the  extracted  value  versus  frequency  is  needed,  as  shown  with  the 
PHEMT,  where  slight  optimization  changes  were  needed  to  improve  the  overall  model 
performance. 
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Noise  Parameter  Sensi'tivitv  Analysis 

In  the  previous  work'  the  noise  modeling  algorithm  was  highly  dependent 
upon  the  input  data  set.  This  data  set  consists  of  the  source  reflection 
coefficient  and  the  associated  FET  noise  figure  measured  at  several  frequencies. 
This  work  initiates  an  investigation  of  the  sensitivity  of  the  data  processing 
required  to  determine  the  coefficients  P,  R,  and  C.  Initially/  a  perturbation 
of  the  input  data  set  parameters  was  investigated. 

The  results  of  a  perturbed  set  of  nominally  50  Ohm  source  impedance  noise 
figure  measurements  is  given  in  Figure  8.  In  this  case  the  magnitude  and  angle 
of  the  reflection  coefficient  and  the  noise  figure  were  separately  varied  by  10%. 
The  variation  in  the  noise  coefficients  was  then  plotted  versus  variations  in  the 
input  parameters. 


Our  investigation  of  noise  parameter  measurement  sensitivity  continues  with 
the  least-squares  fitting  procedure  described  by  Lane^.  It  is  known  that  the 
matrix  of  equations,  used  to  solve  for  the  coefficients  of  Lane's  linearized  form 
of  the  noise  figure  equation,  could  be  ill-conditioned^.  An  IRE  Subcommittee  had 
also  published  standards  on  measuring  noise  in  linear  two-ports  which  defined  a 
procedure  for  tuner  based  measurements  that  helped  avoid  such  ill-conditioning^. 

B0fore  proceeding,  it  is  often  noted  in  mathematical  treatments  of  such 
problems  that  certain  data  sets  may  result  in  matrix  singularity.  Ill- 
conditioning  is  a  more  qualitative  reference  to  the  case  where,  although  the 
matrix  is  not  singular,  the  data  may  be  related  to  the  singular  matrix  such  that 
It  Is  ill-conditioned  to  complete  the  analysis  without  errors.  Therefore,  the 
degree  of  ill-conditioning  may  result  in  an  inaccurate  measurement  of  the  noise 
parameters  and  in  the  worst  case  no  solution  at  all  (a  singular  coefficient 
matrix) . 

In  determining  the  noise  parameters.  Lane  suggested  that  improvement  in  the 
data  processing  technique  was  needed  to  lower  error  sensitivity.  Sannino  then 
reported  on  an  investigation  of  loci  of  Y,  on  the  Smith  Chart  which  produced 
singularities  and  experimentally  exhibited  an  updated  procedure  for  selecting 
source  tuner  states^^.  It  was  concluded  that  a  measurement  of  data  from  tuner 
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Figure  7 


Variation  of  coefficients  P,  R,  and  C  versus  10%  variation  in  the 
the  50  Ohm  input  tuner  data  set  (noise  figure,  source  state 
magnitude  and  angle). 


states  that  belonged  to  at  least  two  families  of  the  singular  loci  was  adequate 
for  a  solution  that  characterized  the  surface  described  by  the  noise  figure 
equation.  The  singular  loci  are  circles  on  the  G,,  B,  plane  of  the  Smith  Chart. 
One  of  the  singular  locus  conditions  is  when  G,,  B,  and  |Yj^  equal  constant 
values.  Also  at  this  time,  Mitama  and  Katoh^  improved  the  least-squares  fitting 
algorithm  of  Lane  by  accounting  for  the  least-squared  error  due  to  the  source 
state  components  G,  and  B,. 

More  recently,  O'Callaghan  and  Mondal  used  a  vectored  error  correction 
approach  was  applied  to  the  measurement  of  noise  parameters"'*.  The  resulting 
method  uses  a  linearized  noise  figure  equation  similar  to  that  of  Mitama  and 
Katoh,  that  is  the  error  sensitivity  due  to  noise  figure  as  well  as  G,  and  B,  is 
included.  Because  the  vectored  method  is  built  upon  a  projection  theorem 
argument,  a  quantitative  as  well  as  qualitative  analysis  of  the  error  sensitivity 
is  shown.  A  coefficient  matrix  for  the  linearized  set  of  equations  is  developed. 
However,  O' Callaghan  and  Mondal  proceed  to  solve  for  the  coefficients  of  the 
linearized  equations  in  closed  form.  The  least  squares  fit  of  these  coefficients 
is  avoided  by  a  decomposition  of  the  matrix  (Cholesky's  Method).  The  singularity 
of  the  matrix  can  then  be  addressed  by  standard  vector  methods.  Additionally, 
the  type  of  successful  vectored  error  correction  that  is  common  to  S-parameter 
network  analyzer  calibration  is  applied  to  the  projection  of  the  vectors  which 
comprise  the  linearized  noise  figure  equation.  The  result  is  that  a  quantitative 
analysis  of  the  measurement  error  sensitivity  and  the  degree  of  ill-conditioning 
of  the  equation  matrix  is  addressed. 

Upon  reviewing  the  details  of  these  analyses  several  questions  concerning 
the  single  tuner  state  algorithm  are  given  as  follows: 

1.  Will  a  least-squares  error  fit  that  also  includes  Y,  errors  prevent  model 
coefficients  P,  R,  and  C  from  being  non-physical  (negative  values  or  C  >  1)? 

2.  What  are  the  linearized  equation  matrix  conditions  on  50  ohm  source  state 
data  and  on  a  data  set  from  a  constant  arbitrary  state,  T,? 

3.  What  are  the  effects  on  the  conditioning  of  the  matrix  when  using  a  fewer 
number  of  frequencies  (data  points)  in  the  present  modeling  procedure? 

A  complete  sensitivity  analysis  of  the  noise  modeling  algorithm  presented  in 

Section  III  was  not  available  at  this  time.  Such  an  experimental  analysis 

requires  a  device  that  is  characterized  by  the  small-signal  modeling  technique 
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presented  in  Section  4.  However,  noise  parameters  for  a  MESFET  and  a  PHEMT  were 
measured  and  a  preliminary  analysis  gives  the  following  results; 

1.  A  study  on  the  ill-conditioning  of  the  coefficient  matrix  associated  with  the 
linearized  noise  figure  equation  was  done.  For  the  50  Ohm  source  state  the 
matrix  is  not  singular,  and  values  for  the  coefficients  P,  R,  and  C  are 
developed.  The  resulting  values  are  dependent  upon  the  size  of  the  data 

set . 

2.  In  the  case  that  the  data  set  is  chosen  with  the  magnitude  of  the  input 
reflection  coefficient, jr.l  ,  constant  the  matrix  is  singular.  For  the  case 
that  the  source  state  varies  about  the  Smith  Chart  a  solution  is  calculated, 
but  the  resulting  noise  parameters  vary  with  different  data  sets. 


VI.  Microwave  Measurements  and  Model  Comparisons 
Microwave  noise  parameter  measurements  performed  from  2.0  to  18.0  GHz  on  three 
devices  each  of  a  0.5  pm  GaAs  MESFET  and  a  0.25  pm  PHEMT.  The  ATN  NP5A  noise 
parameter  test-set  and  measurement  software  was  configured  at  U.S.F.  as  part  of 
this  work.  Results  for  the  MESFET  are  given  below. 
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The  plots  of  Figure  8  represent  the  measured  and  modeled  noise  parameters 
of  a  MESFET'  versus  frequency.  Each  model  is  derived  from  50  Ohm  tuner  state 
data.  P,  R,  and  C  are  determined  using  the  same  algorithm  for  each  model.  A 
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direct  solution  of  the  noise  parameters  is  compared  to  the  "synthesized 
measurement"  of  the  previous  algorithm.  A  subroutine  uses  correlation  matrices 
to  provide  the  solution.  Inputs  to  this  subroutine  are  frequency,  P,  R,  C  and 
the  FET  equivalent  circuit  parameter  values. 
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VIII. 


Millimeter-wave  Noise  Modeling  Conclusions  and _ Extensions 


In  furthering  the  development  of  the  noise  modeling  method  discussed  in 
this  report,  topics  of  continuing  interest  include; 

1.  Verification  of  the  method  at  millimeter-wave  frequencies. 

2.  A  continued  analysis  of  the  least  squares  determination  of  coefficients  P, 
R,  and  C  and  their  sensitivities. 

3.  Small  signal  modeling  improvements  in  the  determination  of  the  parasitic 
resistances. 

4.  An  experimental  investigation  of  the  temperature  dependencies  of  the 
intrinsic  FET  model  noise  sources. 

Item  (2)  may  be  approached  in  two  ways.  One  investigation  would  study  the 
conditioning  of  the  coefficient  matrix  associated  with  the  least  squares  solution 
of  P,  R,  and  C.  As  a  result,  a  consistently  accurate  procedure  for  noise 
parameter  modeling  with  the  single  source  state  method  would  be  defined.  The 
other  area  of  approach  concerns  the  approximation  of  the  noise  figure  equation 
in  terms  of  P,  R,  and  C.  Here  it  has  been  linearized  for  the  least  squares  fit. 
This  is  of  interest  for  measurement  sensitivities,  because  the  actual  non-linear 
system  of  equations  will  have  different  singularities  from  the  linearized 
equation. 

As  part  of  this  investigation  several  DC  and  lower  frequency  measurement 
techniques  were  studied  for  the  determination  of  the  FET  parasitic  resistances. 
The  experimental  implementation  of  such  a  measurement  is  a  natural  extension  of 
this  work,  that  will  experimentally  provide  the  additional  information  needed  to 
complete  a  small  signal  model  extraction. 

The  temperature  dependence  of  the  intrinsic  noise  sources  is  of  interest 
to  U.  S.  F.  and  the  Air  Force.  This  work  helps  to  address  the  goal  of 
determining  the  noise  source  temperature  dependencies.  In  particular,  the 
general  model  derived  here  allows  the  implementation  and  study  of  both  new  and 
existing  intrinsic  model  noise  source  representations. 

VIII.  Summary  and  Conclusions 

This  noise  modeling  investigation  addresses  several  items  necessary  to  the 
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development  of  a  single  source  state  noise  modeling  method.  Emphasis  was  placed 
on  a  general  formulation  of  the  model.  This  resulted  in  the  derivation  of  an 
efficient  noise  modeling  algorithm  from  noise  correlation  matrix  methods.  The 
algorithm  is  useful  for  CAD  implementation,  because  the  model  topology  and 
intrinsic  noise  sources  are  described  by  an  interconnection  of  cascaded  matrices. 
The  small  signal  model  extraction  procedure  adopted  here  updates  the  "brute 
force"  optimization  previously  used  to  determine  model  ECP  values.  Small  signal 
models  were  extracted  for  both  a  MESFET  and  a  PHEMT  with  good  agreement  to 
measured  data.  In  addition,  a  conventional  on-wafer  noise  parameter  measurement 
system  and  related  software  was  configured  at  U.  S.  F.  Noise  parameter 
measurements  were  performed  with  this  conventional  system  over  the  2  to  18  GHz 
range.  For  the  present  single  source  state  model,  as  with  conventional 
measurement  techniques,  a  sensitivity  to  measurement  errors  and  coefficient 
matrix  conditioning  was  studied.  Initiation  and  continuation  of  this  sensitivity 
analysis  is  needed  to  improve  the  least  squares  fitting  procedure.  As  a  result, 
certain  data  sets  were  identified  with  respect  to  the  ill-conditioning  of  the 
solution,  and  further  steps  towards  realizing  a  consistently  accurate  modeling 
procedure  were  presented. 
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Abstract 

This  research  involved  increasing  the  capabilities  of  the  Sinusoidal  Transform  Coder 
(STC),  a  low  bit  rate  digital  speech  coder.  During  this  project,  the  STC  software  was  revised 
to  allow  easier  modification  during  this  project  and  in  future  research.  New  EFT  windowing 
techniques  were  studied  in  order  to  reduce  the  processing  delay  of  the  STC.  Multirate 
parameter  transformation  techniques  were  studied,  and  coder  configuration  parameters  for 
various  rates  were  selected.  Finally,  a  voice  mail  application  which  demonstrates  the  utility 
of  the  transformation  techniques  for  speech  data  compression  and  storage  was  developed. 
This  application,  which  can  be  used  by  any  host  running  TCP/IP  and  directly  connected 
to  the  Internet,  allows  voice  messages  to  be  transmitted  from  a  sender  to  a  recipient  host 
using  a  novel  negotiation  protocol. 
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Introduction 

This  report  describes  research  performed  during  the  AFOSR  Research  Initiation  Pro¬ 
gram  related  to  digital  speech  coding  using  sinusoidal  representations.  This  work  involved 
modifications  and  extensions  to  the  Sinusoidal  Transform  Coder  (STC)  developed  by  R.  J. 
McAulay  at  MIT  Lincoln  Laboratory  under  the  support  of  the  Rome  Laboratories  Digital 
Speech  Laboratory  [6,  8, 9, 10].  The  STC  modifications  and  extensions  provide  significant 
increases  in  flexibility  and  utility,  as  well  as  usability  of  the  code  for  research  purposes. 

This  work  concentrated  on  two  related  areas.  The  first  of  these  areas  is  the  multi¬ 
stage  transformation  of  encoded  speech  from  one  rate  to  a  lower  rate.  This  transformation 
technique  provides  flexibility  not  available  in  other  low  and  medium  bit  rate  coders.  During 
this  project,  the  existing  STC  software,  including  the  transformation  code,  was  restructured 
to  allow  future  modifications  to  be  made  in  a  straightforward  manner.  In  addition,  vector 
quantization  of  the  spectrum  was  added  in  order  to  track  ongoing  work  at  MIT  Lincoln 
Laboratory.  FFT  windowing  techniques  for  a  low  delay  coder  were  explored,  as  was 
selection  of  parameters  for  a  medium  rate  STC  coder.  The  second  major  area  of  effort  for 
this  project  was  the  development  of  a  voice  mail  application  for  use  on  the  Internet.  This 
application  demonstrates  the  utility  of  the  coding  and  transformation  techniques  for  data 
storage  management.  This  application  runs  under  the  UNIX  ®  operating  system  on  most 
common  workstations.  It  allows  voice-based  messages  to  be  exchanged  over  data  networks 
and  stored  efficiently  on  the  recipient  computers. 

®IJNIX  is  a  registered  trademark  of  AT&T. 
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Background^ 

Speech  communications  systems  can  be  evaluated  on  the  basis  of  many  interrelated 
criteria,  including  intelligibility,  perceived  quality,  and  robustness.  Several  of  these  mea¬ 
sures  are  clearly  more  significant  than  others,  but  clearly  speech  coding  algorithms  do  not 
exist  in  isolation  from  the  entire  voice  communications  system.  It  is  therefore  important 
that  flexibility  of  application  and  robustness  to  severe  data  transmission  environments  be 
considered  when  evaluating  speech  coding  methods. 

Speech  Coding  via  Sinusoidal  Representations 

A  variety  of  voice  communications  systems  require  data  rate  compression  of  the  speech 
signal.  Approaches  to  this  problem  can  be  separated  into  two  categories,  algorithms  which 
make  few  assumptions  about  the  nature  of  the  input  data,  such  as  adaptive  Huffman  or 
Lempel-Ziv-Welsh  (LZW)  coding,  and  those  which  are  optimized  for  the  expected  data,  in 
this  case  speech,  such  as  ADPCM  and  LPC-based  coders.  The  first  type  of  compression 
technique  reduces  the  memory  storage  requirements  with  no  information  loss,  while  the 
second  type  involves  some  reduction  in  information  as  compared  to  the  original  data.  Both 
types  of  coders  reduce  the  resource  usage  by  a  fixed  percentage  for  specified  data.  The 
coder  used  in  this  project  allows  for  resource  usage  to  be  balanced  against  speech  quality 
in  a  dynamic  manner. 

The  basic  coding  technology  used  in  this  project  is  the  Sinusoidal  Transform  Coder 
(STC)  algorithm  [6, 7,  8, 9, 10].  The  STC  model  has  been  shown  to  be  effective  at  various 
discrete  rates  from  8  kb/s  to  2.4  kb/s  [9].  Further,  this  coding  technology  is  extremely 
robust  to  manipulation  in  the  parameter  space,  as  shown  by  the  previous  work  in  STC 
parameter  space  transformations  [2].  The  parameter  space  transformation  method  allows 
the  most  optimum  coding  for  this  coder  type  to  be  used  for  a  given  compression  ratio.  The 

'Portions  of  this  section  are  drawn  from  [4]. 
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technique  which  forms  the  basis  of  this  work  allows  compression  in  multiple  stages  of  any 
selected  size,  with  no  interaction  with  the  source. 

Multirate  Parameter  Transformations 

Because  of  the  robust  nature  of  the  representation  used  by  the  STC  algorithm,  not  only 
can  the  coder  operate  at  various  rates  [9],  but  transformations  in  the  parameter  space  can 
be  used  to  change  the  bit  rate  of  the  coded  data. 

The  parameter  transformation  technique  allows  the  coding  for  all  spectrum,  excitation, 
and  frame  length  parameters  to  be  selected  at  each  rate  to  give  the  optimum  coding  for  that 
rate.  Among  the  parameters  which  may  be  varied  at  each  rate  are  the  number  of  cepstral 
coefficients,  the  frequency  range  for  coding,  the  method  of  representation  (frame-fill  or 
the  actual  value)  of  the  mid-frame  excitation  parameters,  and  the  number  of  bits  used  for 
each  parameter.  The  method  for  changing  the  spectral  representation  (that  is,  the  number 
of  cepstral  coefficients)  is  shown  in  Figure  1.  The  cepstral  coefficients  derived  from  the 
coded  parameters  are  used  to  derive  a  spectral  envelope,  which  is  then  used  to  generate 
a  new  set  of  cepstral  coefficients,  which  can  then  be  coded.  The  parameter  temporal 
interpolation  technique  used  for  frame  length  changes  at  each  stage  is  depicted  in  Figure 
2.  The  various  parameters  at  each  new  frame  boundary  (that  is,  at  are 

linearly  interpolated  according  to  the  distance  of  the  new  frame  boundaries  from  the  closest 
old  edges  (Tic_i,Tk,Tk+i,Tk+2).  Provision  is  also  made  for  interpolating  mid-frame  and 
frame-fill  parameters  in  a  similar  manner. 

Intelligibility  tests  have  verified  that  the  transformation  algorithms  were  producing 
output  speech  of  good  quality.  The  2.4  kb/s  test  data  was  generated  by  performing  the 
initial  STC  analysis  at  a  16  kb/s  rate  with  a  10  ms  frame  size.  The  rate  was  then  reduced 
via  the  transformation  to  8  kb/s  with  a  20  ms  frame  length,  then  to  4.8  kb/s  with  a  20  ms 
frame,  and  finally  to  2.4  kb/s  with  a  30  ms  frame  length.  The  results  are  summarized  in 
Table  1.  Most  listeners  find  the  the  perceived  speech  quality  to  be  good,  only  slightly  less 
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Figure  1:  Technique  for  Changing  the  Spectral  Representation 
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Figure  2;  Interpolation  of  Parameters  for  New  Frame  Lengths 
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Table  1;  DRT  Test  Results 


Speaker(s) 

Results 

JE 

91.93 

CH 

95.44 

RH 

91.67 

Total  for  3  Males 

93.01 

pleasing  than  the  untransformed  2.4  kb/s  STC. 

Applications 

The  method  presented  above  is  ideal  for  many  applications.  A  typical  application  for 
this  multi-level  compression  technique,  a  voice  mail  system,  was  created  to  test  the  overall 
concept  [4].  This  project  extends  that  rough  prototype. 

Because  of  the  rapid  development  of  audio  hardware  for  personal  computer  systems  and 
workstations,  voice  mail  transmission  through  the  national  network  known  as  the  Internet 
is  becoming  common.  Almost  every  modem  workstation  and  personal  computer  system 
supports  some  type  of  audio  file  format.  For  such  systems,  corresponding  audio  hardware 
is  provided  to  record  and  play  audio  data. 

The  operation  of  a  typical  voice  mail  system  is  illustrated  in  Figure  3.  The  mail  is 
generated  by  a  sender  and  transmitted  to  the  recipient  via  a  computer  network.  The  storage 
system  processes  the  arriving  packets  to  insure  that  message  consistency  is  maintained. 
The  speech  data  is  then  passed  to  the  appropriate  decoder  and  played  on  the  recipient  host’s 
audio  interface. 

The  original  application  documented  in  [4]  used  the  Sun  Microsystems  Remote  Proce¬ 
dure  Call  (RPC)  protocol,  which  in  turn  used  the  standard  TCP/IP  protocol  stack  for  data 
transmission. 

There  are  two  types  of  file  formats;  "raw"  formats,  where  the  device  parameters  and 
encoding  are  fixed,  and  self-describing  formats  (e.g.,  .wav  format  used  by  Microsoft 
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Figure  3:  Voice  Mail  Application 

Windows),  where  the  device  parameters  and  encoding  are  made  explicit  in  some  form  of 
header.  The  following  table  presents  an  overview  of  popular  file  formats. 

The  main  characteristics  of  audio  hardware  are  described  by  the  sample  rate  (Hz)  and 
the  resolution.  For  current  audio  hardware,  the  sample  rates  vary  from  6-50  kHz  and  the 
resolution  is  8  or  16  bit  linear  or  ^-law.  The  audio  hardware  of  a  modem  computer  usually 
provides  multiple  sampling  rate  and  quantization  options. 

Table  2:  Some  audio  self-describing  file  formats 


Extension  or  Name 

Origin 

Parameters 

.au,  .snd,  .vox 

NeXT,  Sun 

rate,  #channels,  encoding,  info  string 

.aif(f),  .AIFF 

MAC 

rate,  #channels,  sample  width,  lots  of  info 

.iff,  .IFF/8SVX 

Amiga 

rate,  #channels,  instmment  info  (8  bits) 

.voc 

Soundblaster 

rate,  (8  bits/1  ch;  uses  silence  deletion) 

.sf 

IRCAM 

rate,  #channels,  encoding,  info 

HCOM 

MAC 

rate  (8  bits/1  ch;  uses  Huffman  compression) 

One  audio  format  may  be  supported  by  different  audio  hardware  and  be  implemented 
in  more  than  one  computer  system  with  a  different  range  of  parameters.  In  general,  it  is 
possible  to  define  a  conversion  procedure  between  almost  any  pair  of  file  formats  -  often 
with  loss  of  information,  however. 
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In  addition,  a  variety  of  data  compression  methods  (LZW,  ADPCM,  LPC  and  STC  [4]) 
are  used  to  reduce  the  requirement  of  the  network  bandwidth  and  hard  disk  storage.  It 
is  generally  impossible  to  convert  compressed  audio  files  to  different  audio  formats  or  a 
file  compressed  by  different  compression  methods  without  losing  significant  amounts  of 
information.  The  variety  of  audio  hardware,  audio  file  formats  and  compression  methods 
makes  the  exchange  of  audio  files  between  computers  on  the  Internet  very  complicated. 

In  comparison  with  e-mail,  voice  mail  transmission  requires  much  higher  throughput 
and  storage  (500  times  higher  than  the  e-mail  for  low  quality  voice  mail  and  ICKXX)  times 
higher  for  CD  quality  voice  mail).  Therefore,  throughput  and  storage  may  be  a  problem 
in  future  voice  mail  systems.  To  prevent  unnecessary  high  quality  audio  files  flooding  a 
system,  the  voice  protocol  must  be  able  to  get  information  about  out-going  audio  files  and 
determine  whether  to  accept  them  or  not  automatically. 

Existing  voice  mail  protocols  such  as  NVP  (Network  Voice  Protocol,  RFC741,  [3]) 
and  VFIP  (Voice  File  Interchange  Protocol,  RFC978,  [12])  were  not  designed  originally 
for  the  variety  of  hardware  and  audio  formats  of  current  personal  computer  systems  and 
workstation  systems.  MIME  (Multipurpose  Internet  Mail  Extensions,  [1])  was  designed 
for  future  multimedia  mail  communication.  It  is  not  clear  whether  MIME  can  provide  a 
common  base  for  modem  audio  voice  mail  and  support  the  wide  a  variety  of  audio  formats, 
at  least  in  the  near  term.  Furthermore,  available  commercial  voice  mail  packages  used 
on  the  Internet  are  usually  designed  to  convert  audio  files  from  one  commercial  format  to 
another  one  and  not  designed  to  check  compression  methods  in  order  to  transfer  compressed 
audio  files. 

It  is  therefore  important  that  complexity  and  diversity  of  audio  hardware  and  audio 
formats  used  in  the  Internet  environment  be  considered  when  a  new  voice  protocol  is 
designed. 
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Methodology 

The  two  main  thrusts  of  this  project,  coder  development  and  voice  mail  application 
development,  were  able  to  proceed  relatively  independently.  These  efforts  interacted  only 
during  the  instantiation  of  the  coder  into  the  voice  mail  application.  Different  personnel 
were  involved  in  the  segments. 

Speech  Coding  Methods 

The  STC  speech  coding  research  began  by  addressing  the  difficulty  of  working  with 
the  existing  STC  software.  The  STC  source  coder  has  been  used  as  a  research  tool  at  MIT 
Lincoln  Laboratory  and  Rome  Laboratories,  and  was  not  designed  to  be  production  code. 
As  a  result,  the  code  contains  the  remnants  of  many  years  of  research  which  are  not  relevant 
to  the  current  work.  The  code  was  thus  revised  to  reflect  current  requirements  and  the  need 
for  well-structured  code  appropriate  for  use  by  novice  researchers.  The  research  in  this 
area  then  proceeded  to  use  this  code  to  find  STC  parameter  settings  appropriate  for  medium 
(16  kb/s)  bit  rates.  Further,  this  code  was  used  to  investigate  the  performance  of  low  delay 
FFT  windows.  Informal  listening  tests  were  performed  to  evaluate  the  performance  of  the 
various  parameter  settings  as  well  as  the  windowing  functions. 

Voice  Mail  Application 

The  voice  mail  application  was  developed  in  the  UNIX  environment  commonly  avail¬ 
able  on  Sun  and  DEC  workstations.  The  code  development  was  segmented  according  to 
the  functional  blocks  necessary  for  the  application.  In  particular,  code  was  developed  and 
tested  for  the  server  and  client  daemons  for  the  mail  protocol  which  need  to  run  on  each 
host  supporting  this  system,  as  well  as  for  the  graphical  and  non-graphical  user  interface 
programs.  Complete  system  testing  has  also  been  performed,  and  more  extensive  off-site 
testing  is  being  investigated. 
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Results 


Speech  Coding  Research 

Coder  Revisions 

Several  modifications  were  made  to  the  source  code  of  the  STC  speech  coder  in  an 
attempt  to  improve  its  usability  by  researchers.  This  allowed  this  project  to  more  easily 
extend  the  original  version,  and  should  provide  benefits  to  future  research  in  this  area. 

The  source  code  was  restructured  for  improved  efficiency.  For  example,  all  parameters 
related  to  the  pitch  were  declared  as  members  of  a  structure  called  pitch.  The  voicing 
parameters  became  members  of  a  structure  called  probability,  etc.  The  structures  are 
declared  in  a  single  file  that  was  included  with  every  function.  Many  of  the  global  variables 
were  eliminated.  This  method  clarified  parameter  passing  to  various  routines. 

Most  simulations  were  based  on  zero  channel  spacing,  so  the  channel  warping  was 
eliminated. 

A  single  synthesizer  was  developed,  eliminating  the  need  for  a  separate  one  in  the  case 
of  rate  transformation.  It  has  been  tested  extensively  and  is  producing  correct  audio  speech 
files. 

Some  setup  files  have  been  eliminated  and  the  rest  configured  so  that  there  is  one  for 
the  analyzer,  one  for  synthesizer,  and  one  for  the  transformation.  For  this,  a  variable  called 
"prog-type"  has  been  defined  in  the  main  programs  where  prog-type  is  1  for  the  analyzer, 
2  for  the  synthesizer,  and  3  for  the  transformation.  This  allowed  the  identification  of  the 
parameters  passed  to  the  pitch  and  voicing  coding/decoding. 

Instead  of  a  single  median()  function,  2  separate  ones  were  created:  median-pitch()  and 
median-voicing.  This  was  a  consequence  of  the  requirement  for  proper  parameter  passing. 
The  functions  are  part  of  "fOsubs.c". 

All  STC  programs  now  use  a  single  channel  coding  program  "code_gs.c",  thus  elim¬ 
inating  all  the  ones  related  to  the  transform  programs.  This  necessitated  the  definition 
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of  a  pre-processor  variable  named  "XFORM".  This  variable  is  set  to  1  if  the  transform 
programs  are  used  and  0  if  they  are  not. 

All  STC  programs  now  use  uniform  data  files.  When  a  transform  program  is  used,  the 
data  file  for  the  initial  rate  is  entered  at  the  command  line  followed  by  the  data  file  for  the 
final  rate.  This  eliminated  the  need  for  the  two  extra  data  files  used  previously.  A  single 
line  is  included  in  the  data  file  indicating  whether  the  number  of  bits  will  be  specified  for 
the  final  rate  in  transform  programs,  so  this  in  combination  with  the  global  "XFORM" 
dictates  the  flow  of  control  in  this  data  file.  This  also  conforms  with  the  new  code_gs.c 
routine  structure. 

All  references  to  SPOX  have  been  deleted  form  the  updated  files,  and  so  goes  the 
graphics  code. 

The  vector  quantization  code  developed  by  MIT  Lincoln  Laboratory  for  use  in  their 
current  version  of  the  STC  was  added  to  the  new  version  developed  by  the  University  of 
Kansas,  so  that  this  major  breakthrough  could  be  available  in  the  restructured  software. 

Medium  Coder  Parameters 

Various  coder  parameters  were  informally  tested  for  a  medium  rate  coder.  While  various 
combinations  produced  good  speech  quality,  the  increase  in  quality  over  an  example  8  kb/s 
coder  was  not  great.  This  observation  implies  that  alternate  coding  schemes  such  as 
transmission  of  phase  information  should  be  used  to  exploit  the  additional  bandwidth. 
Attempts  to  implement  such  schemes  lead  to  the  previously  described  coder  restructuring 
effort.  Phase  coding  methods  continue  to  be  pursued. 

Low  Delay  Windows 

Because  of  the  robust  nature  of  the  representation  used  by  the  STC  algorithm,  not 
only  can  the  coder  operate  at  various  rates  [9],  but  transformations  in  the  parameter  space 
can  be  used  to  change  the  bit  rate  of  the  coded  data.  The  usefulness  of  the  STC  at  higher 
rates,  however,  is  somewhat  limited  by  the  large  delay  imposed  by  processing  requirements. 
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including  that  incurred  by  FFT  windowing.  In  order  to  address  this  concern,  an  asymmetric 
window  was  incorporated  into  the  code.  It  is  basically  a  two-pole  window  [5]  that  has  the 
form 


10(71)  =  6"  —  aft" 


(1) 


where  a  =  0.85,  bi  =  0.97,  and  62  =  0.95  have  been  shown  to  provide  good  performance. 
Its  z-transform  is 


W(  \  =  (t +  ^ 


(2) 


A  Matlab  program  was  written  in  order  to  analyze  the  frequency  and  phase  response 
of  the  window.  The  frequency  response  was  very  smooth  and  decreased  monotonically 
with  increasing  frequency.  No  ripple  was  observed.  The  phase  response  was  extremely 
close  to  linear.  At  lower  frequencies,  it  showed  a  slight  non-linearity,  but  this  is  irrelevant 
since  a  low-pass  filter  will  be  used.  Other  combinations  of  windows  (i.e.  rectangular  and 
Hamming)  were  tried  but  the  results  obtained  from  the  2-pole  window  produced  the  best 
linear  phase  approximation.  The  exponential  window’s  first  sidelobe  was  27  dB  less  than 
the  peak,  and  the  second  sidelobe  down  7  dB.  The  centroid  for  this  window  (480  points) 
was  calculated  to  be  at  (46.421,  0.082)  for  a  delay  of  5.80  ms  when  using  a  sampling 
frequency  of  8  kHz.  The  code  for  this  window  was  included  in  the  file  "stft-ana.c"  under 
the  anlwinO  function.  The  control  parameter  specified  in  the  data  file  will  allow  its  use 
("exp"  instead  of  "cos"  for  the  Hamming/cosine  window). 

The  window  can  be  shown  to  produce  an  increase  in  the  signal-to-noise  ratio  for 
most  sentences,  along  with  a  crispier  speech  than  the  speech  observed  previously  with  a 
Hamming  window  for  other  coders  [5].  The  speech  audio  files  resulting  from  the  2-pole 
window  produce  a  slight  chirping  at  the  end  of  each  utterance.  This  is  believed  to  be  as  a 
result  of  the  voicing  probability  produced  from  the  voicing  rule  function.  An  attempt  to 


3-13 


vary  the  hard-coded  parameters  within  each  voicing  rule,  such  as  the  SNR,  noise  level,  and 
the  upper  and  lower  limits  on  the  allowed  SNR  is  currently  underway. 

In  order  to  evaluate  the  delay  of  the  coder  with  the  new  window,  a  program  was  written 
to  produce  an  impulse  function,  where  a  single  value  was  non- zero.  This  impulse  was 
then  used  as  an  input  to  the  STC  coder  in  order  to  evaluate  the  delay  introduced  in  the 
coding  process.  The  delay  observed  from  running  simulations  on  a  speech  input  was  40 
ms.  However,  the  delay  observed  from  the  impulse  did  not  make  sense  because  the  result 
indicated  a  110  ms  delay.  The  SoundTool  program  available  on  SPARCstations  was  used 
to  determine  the  delay;  the  sample  at  which  a  speech  waveform  feature  appeared  was 
identified  and  the  time  to  that  sample  was  measured.  This  technique  is  not  very  accurate, 
but  it  was  the  only  one  available  at  the  moment. 

Internet  Voice  Mail  Application  Development 

A  new  voice  mail  protocol  which  guarantees  correct  transmission  of  audio  files  between 
computer  systems  with  different  audio  hardware  and  audio  formats  through  the  Internet 
was  developed.  A  voice  mail  application  based  on  this  protocol  was  implemented  on 
several  UNIX  workstation  systems.  This  voice  mail  application  also  provides  a  graphic 
user  interface  based  on  the  X- Windows  Athena  widget  set,  and  a  non-graphic  user  interface 
based  on  the  UNIX  "curses".  By  using  these  user  interfaces,  users  can  play  and  record 
audio  files,  compress  audio  files  to  reduce  the  requirement  for  the  network  bandwidth  and 
hard  disk  storage,  and  send  and  receive  audio  files  to  and  from  a  remote  computer  system 
with  different  audio  hardware. 

The  Protocol 

The  Internet  Voice  Mail  Protocol  (IVMP)  can  be  used  on  low-end  computers  with 
one  type  of  audio  hardware  (called  a  computer  with  single  audio-mode  or  single  mode 
computer)  as  well  as  on  high-end  workstations  with  several  types  of  audio  hardware  (a 
multi-mode  computer  system).  All  audio  formats  are  divided  into  two  categories:  standard 
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audio  formats  and  non-standard  formats. 

The  server  for  a  local  network  (LAN)  may  provide  voice  mail  service  for  several  types 
of  computer  systems  such  as  IBM  PCs  or  Macintoshes,  which  are  connected  to  this  local 
network.  This  server  can  be  considered  as  a  computer  with  multi-audio  modes. 

The  conversion  between  any  pair  of  standard  audio  formats  is  well  defined,  and  the 
conversion  between  any  pair  which  includes  one  or  two  non-standard  audio  formats  is 
experimental,  mainly  provided  by  programmers  who  use  such  new  audio  hardware,  new 
audio  file  format  or  new  compression  method. 

The  implementation  of  this  voice  mail  protocol  includes  a  database  of  standard  audio 
formats,  compression  methods,  and  a  library  of  conversion  functions  between  standard 
file  formats.  The  higher  the  version  of  the  implementation,  the  larger  the  database  and 
the  library.  The  local  programmers  are  required  to  configure  the  conversion  table  of  their 
computer  system.  The  entries  in  this  conversion  table  are  determined  by  the  configuration  of 
the  computer  system  and  the  types  of  audio  files  the  "server"  can  accept.  Local  programmers 
can  add  non-standard  conversion  relationships  into  this  conversion  table  as  well. 

The  lowest  version  of  the  implementation  of  this  protocol  is  defined  as  1.  Local 
programmers  can  define  the  version  of  the  "server"  as  zero  when  they  want  to  force  all 
clients  to  convert  audio  files  before  they  are  sent. 

This  voice  protocol  provides  two  services:  1)  the  client  sends  information  about  the 
queued  audio  files  to  the  remote  site  and  gets  information  regarding  audio  formats  the 
remote  computer  system  can  accept  before  the  transmission  of  audio  files  start;  2)  audio 
files  are  transmitted  through  Internet  network  if  these  audio  files  are  acceptable  by  the 
remote  site. 

The  negotiation  between  a  client  and  a  server  is  completed  by  the  exchange  of  control 
messages.  A  simple  algorithm  provides  basic  negotiation  capabilities  between  the  client  and 
the  server.  This  algorithm  can  be  considered  as  the  core  of  a  more  complicated  negotiation 
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process  which  may  add  more  criteria  in  order  to  set  different  priorities  to  different  types  of 
audio  files  or  different  clients. 

Eleven  control  messages  are  specified  in  the  protocol; 

#1  Connection  Required  Message 
#2  Connection  Response  Message 
#3  Standard  Information  Message 
#4  Standard  Information  Response  Message 
#5  Nonstandard  Information  Message 
#6  Nonstandard  Information  Response  Message 
#7  Disconnect  Message 
#8  Resend  Information  Message 
#9  Short  Connect  Require  Message 
#10  Renegotiation  Message 
#1 1  Multi-mode  Information  Message 

In  this  simple  negotiation  algorithm,  the  negotiation  processes  are  divided  into  5  modes 
based  on  the  parameters  of  the  audio  messages  and  the  acceptable  parameters  of  the  server. 
For  mode  I,  the  server  sends  its  parameter  to  the  client  and  later  converts  audio  files 
from  one  standard  format  to  another  standard  format  if  necessary,  or  determines  whether  a 
compressed  audio  file  should  be  sent.  Mode  II  is  similar  to  mode  I  but  the  client  and  the 
server  exchange  roles.  Mode  III  is  also  similar  to  mode  I  but  both  sites  support  non-standard 
formats.  In  modes  IV  and  V,  only  one  site  supports  a  given  standard  audio  format.  The 
client  supports  non-standard  formats  for  mode  IV,  and  the  server  supports  non-standard 
modes  for  the  mode  V.  In  modes  IV  and  V,  the  site  which  supports  the  non-standard  formats 
does  the  conversion  of  the  audio  files.  Mode  I  is  designed  for  most  audio  file  transfers  and 
the  rest  of  the  negotiation  modes  are  designed  for  special  cases. 
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IVMP  Implementation 

An  Internet  voice  mail  application  for  the  UNIX  operating  system  is  shown  in  Figure  4. 
This  system  is  designed  to  complete  the  following  operations:  play  and  record  audio  files; 
compress  audio  files;  and  send  and  receive  audio  files  through  the  Internet  to/from  a  remote 
site  with  different  audio  hardware.  This  system  consists  of  the  user  interface  (graphical 
and  non-graphical)  module,  the  data  compression  module,  the  audio  hardware  module,  the 
voice  mail  protocol  module,  and  the  database  module. 

A  versatile  tool  for  converting  between  various  audio  formats  is  Sound  Tools  [11]  (not 
to  be  confused  with  the  NeXT  library  of  the  same  name).  This  is  a  library  for  reading 
and  writing  various  types  of  audio  files  and  applying  special  effects  (e.g.,  echo,  or  channel 
averaging)  to  buffered  audio  data.  In  this  implementation,  the  conversions  of  audio  files 
between  current  audio  formats  are  completed  by  calling  the  Sound  Tools  Library. 

All  information  needed  in  the  user  interface  module,  data  compression  module,  and 
the  voice  mail  protocol  module  are  stored  in  the  database  module.  Database  files  in  the 
database  module  can  be  modified  through  the  user  interface  or  through  any  text  editor.  The 
voice  mail  protocol  provides  the  "client"  and  "server"  which  both  are  daemon  processes 
and  always  run  in  the  background.  The  parameters  of  an  outgoing  audio  file  are  sent  from 
either  the  graphic  user  interface  "xivmp-ui"  or  the  non-graphical  user  interface  "ivmp-ui" 
to  the  daemon  process  "client"  through  the  BSD  UNIX  socket  interface. 

The  voice  mail  protocol  is  designed  to  exchange  information  regarding  audio  file 
formats  and  audio  hardware  between  the  "client"  and  the  "server"  before  the  transmission 
starts.  Information  regarding  particular  audio  files  is  stored  in  a  special  database  file  when 
an  audio  file  is  created. 

The  graphical  user  interface  provides  a  friendly  way  to  control  this  voice  mail  ap¬ 
plication,  but  it  requires  many  resources.  The  non-graphical  user  interface  provides  an 
alternative  on  UNIX  systems.  It  is  designed  to  match  the  graphical  user  interface  in 
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Figure  4;  Voice  Mail  Application 
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functionality. 

In  the  voice  mail  protocol,  "client"  and  "server"  programs  are  implemented.  Each 
consists  of  three  parts:  a  connection  process  based  on  BSD  sockets;  a  negotiation  process 
based  on  control  messages  and  the  algorithms  mentioned  earlier;  and  a  file  sending  or  file 
receiving  process  based  on  the  TCP/IP  suite.  When  the  "client"  receives  a  set  of  parameters 
for  an  audio  file  from  a  user,  it  forks  off  a  child  process  to  handle  the  negotiation  process 
and  the  transmission  process  between  "client"  and  remote  "server".  After  successful 
negotiation,  the  "client"  will  signal  the  owner  of  this  audio  file  using  e-mail.  Similarly, 
the  "client"  will  signal  the  owner  of  errors  when  the  audio  file  cannot  be  sent.  Except  for 
the  case  of  a  failed  connection  to  the  "server",  the  "client"  will  delete  audio  file,  modify 
the  database,  and  terminate  the  child  process  after  signaling  success  or  error.  When  the 
"server"  accepts  a  connection,  it  forks  off  a  child  process  to  handle  the  connection.  The 
parent  process  continues  to  wait  for  more  connections.  After  completing  the  transmission, 
the  "server"  modifies  corresponding  database,  announces  the  acceptance  of  this  audio  file, 
and  terminate  the  child  process. 

The  compression  module  uses  the  STC  analyzer  to  code  the  data  in  the  STC  format, 
if  it  is  not  already  using  that  representation  (this  encoding  provides  a  4-to-l  compression 
over  standard  64  kb/s  /i-law  coding).  After  the  initial  compression,  the  data  is  stored  to 
disk.  The  storage  management  logic  then  checks  the  storage  space  remaining  available, 
and  performs  rate  transformations  as  required.  The  memory  management  control  logic  is 
shown  in  Figure  5.  This  simple  scheme  reduces  the  bit  rate  of  all  of  the  stored  messages 
when  a  specified  threshold  is  reached.  In  this  manner,  the  total  memory  usage  is  reduced, 
with  corresponding  quality  reductions  in  all  of  the  messages.  More  advanced  control 
mechanisms,  such  as  transformation  of  previously  checked  or  old  messages  before  new 
messages,  could  be  applied  if  desired. 

This  implementation  of  the  voice  mail  application  has  been  extensively  tested  on  Sun 
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Figure  5:  Message  Compression  Logic 


SPARCstation- 1+  and  SPARC  IPX  hosts,  and  Digital  Equipment  Corporation  DECStation- 
5000  workstations. 

Summary  of  Results 

This  research  has  extended  speech  coder  algorithms  and  produced  utilities  for  networked 
workstations  which  provide  increases  in  the  utility  of  those  coders.  The  modified  STC  coder 
and  documentation,  and  the  voice  mail  application  and  documentation,  will  be  supplied  to 
Rome  Laboratories  under  a  separate  cover. 

The  results  of  this  and  related  research  have  been  described  in  the  following  papers: 

J.  B.  Evans  and  T.  G.  Champion,  "Robust  Speech  Coding  and  Reconstruction 
Techniques",  1992  Jnt.  Conf  Signal  Processing  Appl.  &  Tech.,  Boston, 
Massachusetts,  Nov.  1992. 

B.  Yu  and  J.  B.  Evans,  "An  Internet  Voice  Mail  Protocol",  1992  Midwest  Elec. 

Tech.  Conf,  Ames,  Iowa,  March  1992. 
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Further  publications  based  on  this  work  are  expected. 
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Conclusion 


The  work  performed  under  this  program  has  explored  algorithms  for  improved  capabil¬ 
ity  for  speech  data  storage  and  transmission,  and  resulted  in  computer  network  applications 
demonstrating  those  algorithms. 

A  multi-stage  speech  compression  algorithm  using  parameter  space  transformations, 
which  was  developed  as  part  of  the  Air  Force  Office  of  Scientific  Research  Summer 
Faculty  Program,  was  investigated.  This  new  method  allows  speech  quality  to  be  balanced 
against  storage  requirements,  while  maintaining  high  quality  for  each  particular  rate.  This 
algorithm  provides  for  more  efficient  use  of  resources;  some  quality  can  be  sacrificed  as 
conditions  require,  while  normal  quality  can  be  maintained  when  resource  usage  is  light. 
In  this  project,  parameter  selections  for  a  medium  rate  (16  kb/s)  STC  were  studied.  The 
results  of  these  studies  suggest  that  additional  work  in  phase  coding  would  be  valuable 
for  obtaining  higher  quality.  This  project  also  investigated  new  techniques  for  reducing 
the  delay  of  the  STC  algorithms,  so  that  they  can  be  used  in  environments  where  that 
characteristic  is  important.  A  low  delay  FFT  window  was  evaluated  for  use  with  the  STC, 
and  was  found  to  provide  acceptable  speech  quality  with  some  modification  to  various  STC 
parameters. 

The  utility  of  the  parameter  transformation  technique  was  demonstrated  by  its  use  in 
an  example  voice  messaging  application.  This  application  runs  under  the  UNIX  operating 
system,  and  makes  use  of  the  TCP/IP  protocol  suite  for  reliable  transport  of  the  speech 
data.  Two  user  interfaces  were  developed,  one  based  on  an  X- Windows  graphical  interface 
and  the  other  based  on  standard  dumb  terminals.  A  novel  negotiation  and  transmission 
protocol  was  developed  for  this  application.  This  application  uses  the  STC  transformation 
technique  to  allow  dynamic  management  of  disk  resources  on  recipient  hosts.  Additional 
research  into  the  application  of  these  techniques  will  certainly  demonstrate  their  value  for 
easing  integrated  network  management  and  control  as  well  as  host  resource  management. 
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Abstract 


Most  strategies  for  fault  management  are  effective  only  for  a  narrow  range  of  fault 
classes.  In  survivable  distributed  systems,  a  wide  range  of  operating  environments  may  be 
encountered  that  require  different  strategies  to  be  used  at  different  times.  This  report 
discusses  how  adaptive  fault  management  can  be  used  to  select  the  most  appropriate 
methods  for  assuring  survivability  under  conditions  that  can  suddenly  and  drastically  change. 

There  are  three  basic  issues  in  this  research.  First,  a  taxonomy  for  fault  management 
mechanisms  must  be  established  that  allows  a  system  to  differentiate  among  possible 
choices.  Second,  a  characterization  of  operating  environments  must  be  developed  that  can 
be  used  by  a  system  to  dynamically  match  a  fault  management  mechanism  to  the  current 
environment.  Finally,  a  distributed  testbed  must  be  constructed  that  will  support  a  C^I 
application  and  the  collection  of  metrics  that  fit  into  the  models  constructed  for  the 
abovementioned  mechanisms  and  environments.  This  paper  describes  the  results  of  our 
investigation  into  these  three  issues  and  our  plans  for  future  work. 
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A  FRAMEWORK  FOR  ADAPTIVE  FAULT  MANAGEMENT 
IN  SURVIVABLE  C^I  SYSTEMS 

Rex  E.  Gantenbein 

INTRODUCTION 

Survivable  computing  systems  are  those  designed  to  continue  to  perform  adequately 
in  the  face  of  various  kinds  of  adversity  [Neumann92].  The  increased  dependency  of  the 
military  on  computer-based  command,  control,  communication,  and  intelligence  (C^I) 
systems  has  made  the  survivability  of  such  systems  an  important  research  problem.  A 
complex  system  that  can  justifiably  be  trusted  to  perform  its  duties  under  less-than-ideal 
conditions  must  include  some  form  oi  fault  management  to  allow  the  system  to  detect  and 
respond  to  failures  due  to  these  conditions. 

Unfortunately,  most  strategies  for  fault  management  consider  only  a  very  few  specific 
classes  of  faults  to  which  they  can  respond.  The  assumptions  about  a  system’s  operating 
environment  inherent  in  its  fault  management  strategy  cannot  be  violated  when  the  system 
is  deployed.  If  this  happens,  the  system  may  be  unable  to  cope  with  the  conditions 
encountered  and,  consequently,  be  in  danger  of  catastrophic  failure.  This  presents  a 
problem  for  many  military  systems,  as  operating  environments  can  rapidly  or  drastically 
change  under  combat  conditions. 

To  prevent  system  failure  due  to  a  rapidly  changing  environment,  a  system  can 
employ  adaptive  fault  management,  a  design  and  operation  paradigm  in  which  the  fault 
management  mechanisms  can  be  changed  or  modified  to  respond  to  variations  in  the 
system’s  behavior,  environment,  or  requirements.  Unlike  static  fault  management,  which 
typically  uses  a  single  mechanism  to  respond  to  faults  in  the  system,  adaptive  fault 
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management  allows  the  system  to  bring  into  play  any  of  a  number  of  alternative 
mechanisms.  By  adapting  to  changing  conditions,  the  system  can  continue  to  perform 
adequately  in  a  variety  of  environments,  including  significant  loss  of  processing,  data,  or 
communication  resources. 

Over  the  past  year,  researchers  from  the  University  of  Wyoming  Department  of 
Computer  Science  have  attempted  to  establish  a  foundation  for  the  study  of  adaptive  fault 
management  as  a  method  for  achieving  survivability  in  distributed  C^I  systems.  Our  goal 
has  been  to  define  the  major  issues  in  this  approach  and  to  propose  some  solutions.  In  this 
report,  we  present  the  results  from  our  research  so  far. 

RESEARCH  ISSUES  IN  ADAPTIVE  FAULT  MANAGEMENT 

We  present  our  research  in  the  context  of  distributed  computing  systems,  since  many 
C^I  applications  are  designed  to  be  distributed  over  several  processing  nodes,  geographically 
dispersed  and  loosely  coupled  by  communication  links.  Each  node  may  contain  resources 
for  processing,  storing,  and  transmitting  information,  although  not  all  resources  need  be 
present  at  every  node.  Operations  to  access  instances  of  each  of  these  resource  classes  are 
provided  by  server  processes  on  behalf  of  (potentially  remote)  clients,  which  exchange 
messages  over  the  communication  system  to  request  and  carry  out  resource  accesses. 

We  also  assume  that  the  system  is  dedicated  to  a  single,  multi-user  application 
composed  of  a  number  of  cooperating  client  processes,  which  may  communicate  among 
themselves  as  well  as  with  servers.  This  application  may  be  partially  layered,  so  that  one 
component  of  the  application  provides  a  higher-level  service  interface  to  another.  In  this 
model,  an  error  occurs  when  the  application  (or  any  component)  is  unable  to  meet  its 
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requirements  with  respect  to  a  service,  and  failure  is  the  consequence  of  an  error 
manifesting  itself  in  a  server’s  inability  to  deliver  its  required  service  to  one  or  more  of  its 
users.  Survival,  in  this  case,  thus  means  the  avoidance  of  failure  in  critical  services  at  the 
highest  (end-user)  layer  of  the  application. 

The  success  of  an  adaptive  approach  to  fault  management  is  dependent  on  the  system 
being  able  to  dynamically  determine  how  it  will  attempt  to  avoid  failure.  Our  approach  uses 
an  adaptation  data  base  containing  information  about  the  mechanisms  that  may  be 
employed  for  this  purpose  [Gantenbein92].  This  information  consists  of; 

•  a  rule  base  that  supports  decisions  about  the  appropriateness  of  each  available 
strategy  under  existing  internal  and  external  conditions,  and 

•  a  data  base  that  contains  characterizations  of  the  mechanisms  that  can  be  used  to 
implement  these  strategies. 

During  operation,  the  system  continuously  collects  information  about  the  current  external 
and  internal  conditions  to  interpret  its  environment.  An  adaptive  behavior  manager  in  the 
system  uses  the  rule  base  to  relate  this  information  to  the  data  base  of  mechanisms  and  to 
select  the  fault  management  mechanism  that  can  best  maintain  "adequate"  behavior  under 
adverse  conditions. 

There  are  a  number  of  research  tracks  inherent  in  an  adaptive  approach  to  avoiding 
failure.  Our  work  has  concentrated  on  the  following  three  tracks: 

•  characterizing  mechanisms  for  fault  management  to  differentiate  them  for  adaptive 
decision  making; 

•  defining  system  requirements  so  that  the  system  can  efficiently  determine  whether 
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it  is  performing  "adequately"  under  a  variety  of  adverse  conditions;  and 
•  selecting  a  testbed  that  can  support  both  development  of  reliable  distributed 
applications  and  collection  of  run-time  data  that  can  be  used  in  making  decisions 
about  the  need  for  change  in  the  fault  management  mechanism. 

These  three  research  areas  are  fundamental  to  the  adaptive  fault  management  approach  to 
survivable  systems.  Our  work  has  concentrated  on  developing  models  that  can  be  used  to 
evaluate  proposed  solutions  for  each  track.  We  present  the  results  of  our  work  in  the  next 
section. 

RESEARCH  RESULTS 

The  University  of  Wyoming’s  involvement  in  adaptive  fault  management  began  with 
the  author’s  participation  in  the  1991  AFOSR  Summer  Faculty  Research  Program  at  Rome 
Laboratory.  Here,  collaboration  with  members  of  the  Computer  Systems  branch  of  the 
Command,  Control,  and  Communication  Division  led  to  a  successful  proposal  to  the 
AFOSR  Research  Initiation  Program  for  1992.  Additional  support  for  this  work  came  from 
the  University  of  Wyoming  Planetary  and  Space  Science  Center  and  from  a  second  year  of 
participation  in  the  Summer  Faculty  Research  Program  during  1992. 

Characterizing  fault  management  mechanisms 

The  four  basic  phases  of  fault  management  are  error  detection,  damage  assessment, 
damage  confinement,  and  response.  Most  previous  classification  schemes  for  fault 
management  mechanisms  have  grouped  the  mechanisms  according  to  the  type  or  amount 
of  hardware  or  software  redundancy  required  and  the  phase  at  which  the  redundancy  is 
applied. 
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Since  we  are  more  interested  in  the  operating  conditions  in  which  a  given  mechanism 
is  most  effective,  we  instead  define  three  general  fault  management  strategies  of  which  a 
mechanism  may  be  an  instance  [Gantenbein91]: 

•  optimistic  strategies  are  those  used  when  resources  (including  time)  are  sufficiently 
plentiful  to  allow  explicit  detection  and  recovery  from  errors  that  occur  without 
significant  effects  on  the  other  requirements  of  the  system; 

•  pessimistic  strategies  are  used  when  the  number,  type,  or  frequency  of  errors  requires 
the  additional  allocation  of  system  resources  to  fault  tolerance  in  order  to  prevent 
errors  from  occurring  and  causing  loss  of  critical  services;  and 

•  ultrapessimistic  strategies  are  used  only  when  conditions  have  degraded  to  the  point 
where  too  few  resources  are  available  to  maintain  all  critical  services  as  well  as  the 
fault  tolerance  mechanisms. 

These  categories  serve  as  the  basis  of  a  taxonomy  for  fault  management  techniques.  By 
classifying  a  technique  into  one  of  these  three  categories,  we  can  effectively  determine 
whether  it  matches  a  particular  set  of  operating  conditions.  We  can  broadly  apply  these 
categories  to  the  three  basic  resource  classes  in  a  distributed  system  [KohlerSl]:  processing 
(generating  or  transforming  information),  data  (storing  information),  and  communication 
(transmitting  information).  We  classify  individual  techniques  as  optimistic,  pessimistic,  or 
ultrapessimistic  depending  on  the  resources  needed  to  detect,  confine,  assess,  and  respond 
to  errors  in  the  system. 

Optimistic  processing  techniques  depend  on  explicit  error  detection,  which  requires 
a  significant  amount  of  a  priori  information,  both  about  the  application  and  the  errors  that 
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can  occur  in  its  operation.  Response  to  a  detected  error  typically  involves  restoration  of  an 
error-free  state  and  an  attempt  to  repeat  the  operation  or  invoke  an  alternate  component. 
System  services  are  suspended  during  the  response,  but  little  information  about  the 
application  or  error  is  needed  to  carry  it  out. 

Pessimistic  strategies  typically  require  replication  of  a  server  or  resource,  both  to 
detect  errors  and  to  provide  redundant  information  that  allows  an  error  to  be  masked.  This 
involves  significant  resource  allocation  to  fault  management.  Compared  to  the  optimistic 
techniques,  either  new  allocations  will  need  to  be  made  or  some  degradation  in  other 
services  will  occur.  Since  errors  are  not  explicitly  detected  and  an  error-free  state  is  created 
dynamically  by  the  replicated  resources,  very  little  a  priori  information  is  required  for  either 
error  detection  or  response. 

The  resource  overhead  of  both  optimistic  and  pessimistic  strategies  allow  a  system 
to  continue  to  perform  adequately  under  adverse  conditions,  as  long  as  sufficient  resources 
exist.  However,  in  some  situations,  the  resources  may  not  be  available  for  either  of  these 
approaches  to  be  effective.  If  the  system  is  to  survive,  detection  and  response  must  be 
carried  out  using  information  about  the  application  and  the  error  to  set  the  system  state  to 
a  default  or  approximated  error-free  state  so  that  the  system  can  continue  to  provide  at  least 
some  portion  of  its  expected  service.  Unfortunately,  an  approximation  or  default  is  likely 
to  cause  the  system  to  violate  its  specification,  at  least  temporarily.  Therefore,  the 
expectations  on  the  service  must  be  relaxed  to  avoid  the  degraded  service  delivery  from 
being  detected  as  an  "error"  and  causing  further  degradation.  This  requires  significant  a 
priori  knowledge  about  the  application. 
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Most  well-known  fault  management  techniques  fall  into  one  of  these  three  strategy 
classes.  Optimistic  and  pessimistic  techniques  have  been  widely  used  to  detect  and  recover 
from  and  mask  errors  in  both  hardware  and  software.  Ultrapessimistic  techniques  are  less 
widely  used,  probably  because  of  their  application-specific  nature.  In  Figure  1  below,  we 
present  a  number  of  fault  management  techniques,  taken  from  [Armstrong91],  as  examples 
of  these  general  strategies  for  the  processing,  data,  and  communication  resource  classes. 


Classification  Characteristics 

Processing 

Data  Storage 

Communication 

1 

Optimistic: 

devotes  resources  primarily 
to  computation;  needs  a 
priori  information  for  error 
detection;  recovers  state  by 
recomputation 

Standby  ver¬ 
sions: 

Cold  hardware 
sparing, 

recovery  blocks 

transactions, 

checkpointing 

retransmission 

2 

Pessimistic: 

devotes  resources  largely  to 
error  detection  and 

response;  needs  little  a  priori 
information  for  either  task; 
recovers  state  by  masking 

Multiple  ver¬ 
sions: 

N-modular 
redundancy, 
N-version  pro¬ 
gramming, 
hybrids 

replicated  data, 
error-correcting 
codes 

error-correcting 
codes,  multi¬ 
casting,  redun¬ 
dant  media 

3 

Ultrapessimistic: 
devotes  resources  to  critical 
services;  relaxes  system 
requirements;  requires  a 
priori  information  for  both 
error  detection  and 

response;  recovers  state  by 
approximation 

cold  sparing 

without  check¬ 
points 

compensating 

transactions 

network  parti¬ 
tioning 

Figure  1.  Examples  of  fault  management  strategies. 


Monitoring  the  operating  environment 

The  identification  of  these  three  strategies  for  fault  management  allows  us  to 
differentiate  among  a  number  of  mechanims  and  determine  which  are  most  effective  in  a 
given  operating  environment.  Howver,  the  need  for  dynamic  selection  of  a  fault 
management  mechanism  is  created  by  a  change  in  operating  conditions,  which  must 
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somehow  be  signaled  to  the  adaptive  behavior  manager.  Thus,  the  system  must  be  able  to 
monitor  the  operating  environment  and  determine  whether  a  change  has  taken  place  that 
indicates  the  need  for  change  in  the  fault  management  strategy. 

Monitoring  operating  conditions  at  run  time  requires  an  efficient,  quantifiable  metric 
that  can  measure  changes  in  the  environment  and,  more  importantly,  their  effect  on  a 
system’s  ability  to  deliver  its  services.  This  metric,  the  objective  function,  is  a  partial 
specification  of  a  service  that  uses  derived  or  observed  attributes  representing  the 
environment  to  evaluate  the  delivery  of  the  service  in  that  environment  [Gantenbein91]. 

This  function  should  express  all  of  the  objectives  that  a  system  service  must  meet  to 
be  survivable.  Objectives  may  include  such  things  as: 

•  functionality,  the  extent  of  the  delivery  or  accessibility  of  the  expected  service; 

•  performance,  a  measure  of  the  timeliness  of  the  delivery  of  the  service; 

•  consistency,  the  agreement  among  a  set  of  servers  (or  among  successive  invocations 

by  a  single  server)  on  the  contents  or  attributes  of  a  service; 

•  precision,  the  latitude  that  will  be  accepted  in  the  delivery  of  a  service; 

•  security,  a  measure  of  the  service’s  resilience  to  deliberately  introduced  faults;  and 

•  safety,  the  risks  associated  with  delivery  of  the  service. 

Objectives  such  as  these  can  be  defined  as  an  expected  region  for  the  value  of  the  objective 
function,  which  may  be  a  single  function  or  a  composition.  The  value  of  the  function  then 
measures  how  closely  the  delivery  of  the  service  matches  its  expectations,  and  can  thus  be 
used  by  the  adaptive  behavior  manager  to  decide  when  a  new  strategy  is  needed  and  which 
strategy  to  apply  next. 
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To  illustrate,  we  can  define  an  objective  function  to  measure  the  number  of  active 
replications  of  a  process  in  a  distributed  system  of  m  nodes  with  n  <  m  tasks  that  must  be 
executed  on  some  subset  of  those  nodes.  Let  [X^j]  be  the  n  by  m  matrix  of  zeros  and  ones, 
where  X^j  =  1  if  task  i  is  scheduled  on  node  j  and  0  otherwise.  The  number  of  processors 
to  which  task  i  is  to  be  assigned  is  the  replication  factor  r,  of  the  task.  We  can  evaluate 
whether  the  scheduler  has  met  the  requirement  that  each  task  be  assigned  to  0  <  r,  <  m 
nodes  using  the  function 

m 

in  which  [X^j]  is  updated  by  the  scheduler  and  i  and  m  are  system  design  factors.  With  this 
function,  the  adaptive  behavior  manager  can  determine  the  replication  of  any  task  in  the 
system  and  thus  evaluate  the  effectiveness  of  the  current  scheduling  algorithm. 

To  carry  out  this  kind  of  analysis  for  all  the  dependability  attributes  of  a  distributed 
C^I  system,  we  must  define  the  kinds  of  environmental  changes  to  be  considered  in  each  of 
the  resource  classes  identified  in  the  previous  section.  From  this,  we  can  develop  metrics 
that  will  allow  us  to  monitor  the  changes  and  determine  when  a  change  in  the  fault 
management  strategy  is  necessary. 

According  to  our  model,  change  in  the  fault  management  strategy  is  necessary  when 
the  strategy’s  assumptions  about  the  operating  conditions  do  not  match  the  current 
conditions.  Methods  for  detecting  such  a  situation  include  monitoring  the  conditions  for 
such  things  as  resource  availability,  error  rates,  communication  delays,  etc.,  and  match  the 
conditions  against  the  rule  base  to  determine  which  mechanism  would  be  most  appropriate 
for  the  conditions.  In  this  way,  the  system  can  predict  problems  in  the  system  before  they 
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occur. 


However,  the  objective  function  also  allows  us  to  explicitly  detect  errors  in  the 
system.  If  the  computed  value  of  an  objective  function  does  not  fall  into  the  expected 
region,  then  we  say  that  an  anomaly  has  occurred  in  that  objective.  An  anomaly  is, 
obviously,  the  internal  manifestation  of  an  error  in  the  system,  so  we  can  model  errors  in 
service  delivery  in  terms  of  anomalies  in  the  objective  for  the  service.  The  advantage  of  this 
model  is  that,  since  we  have  a  region  for  objective  values  for  any  service,  we  allow  the 
system  some  tolerance  in  meeting  it.  Furthermore,  tradeoffs  between  objectives  can  be 
specified  in  this  model  by  policies  such  as  maximizing  the  number  of  objective  functions  in 
their  expected  regions  or  minimizing  the  likelihood  that  an  objective  will  not  be  met. 

In  order  for  this  kind  of  adaptivity  to  be  effective,  it  is  necessary  that,  first,  the  system 
designer  be  able  to  specify  the  boundaries  of  adequate  behavior  for  a  given  environment. 
The  definition  of  these  boundaries  comes  from  the  system  requirements,  which  specify  the 
expected  behavior  of  a  server  and  can  thus  be  used  to  evaluate  the  server’s  effectiveness  in 
the  current  environment.  These  boundary  values  [Kim90]  are  used  by  the  objective  function 
to  indicate  that  a  change  in  the  environment  has  violated  the  assumptions  of  the  current 
fault  management  strategy. 

Matching  the  regions  of  adequate  behavior  to  an  appropriate  fault  management 
mechanism  is  a  complex  problem  for  a  nontrivial  system.  One  approach  to  doing  this  is  to 
define  the  regions  as  failure  classes  that  are  likely  to  occur  in  the  system  (independent  of 
the  cause  of  the  failure),  and  then  link  each  failure  class  to  a  given  mechanism  that  can 
prevent  it.  Any  detected  anomaly  in  the  system  would  be  mapped  into  a  failure  class,  which 
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in  turn  would  suggest  the  mechanism  to  recover  t'rom  it.  This  level  of  granularity  of  error 
recovery  presumes  an  efficient  dynamic  selection  algorithm,  which  could  be  based  upon  an 
exception  handling  approach  similar  to  that  proposed  in  [Liu92]. 

Identification  of  the  failure  classes  and  the  mapping  to  a  given  mechanism  could  be 
done  using  fault  tree  analysis.  A  fault  tree  illustrates  the  logical  interrelationships  of  basic 
events  that  lead  to  an  undesired  event  at  the  root  of  the  tree  [Hansen89].  This  approach 
can  be  used  to  define  the  environment  conditions  that  would  lead  to  a  failure.  The  design- 
phase  characterization  of  the  environment  in  terms  of  its  failure  classes,  coupled  with  the 
run-time  collection  of  information  about  the  environment,  gives  an  implementation  of  the 
objective  function  that  can  be  used  to  handle  both  anticipated  (explicitly  detected)  and 
unanticipated  (predicted  by  mismatches  between  the  environment  and  the  mechanisms) 
errors. 

Valuable  insight  into  this  problem  may  come  from  a  proposed  technique  for 
automatic  error  recognition  [Iyer90].  The  methodology  for  this  technique  consists  of: 

(1)  recognition  of  errors  among  which  relationships  can  exist; 

(2)  confirmation  of  the  relationships  among  the  errors;  and 

(3)  quantifying  the  strength  of  those  relationships. 

The  rate  at  which  system  errors  occur  identifies  the  error  states  among  which  relationships 
occur  by  the  identification  of  similar  error  patterns.  Frequent  instances  of  errors  within  a 
short  time  period  describes  the  same  error  condition  or,  at  the  least,  highly  related  errors. 
These  error  groups  can  be  used  to  identify  the  symptoms  of  persistent  problems  and  the 
times  of  their  occurrence.  This  approach  can  be  used  for  on-line  error  detection. 
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In  the  adaptive  fault  management  model,  such  an  approach  can  be  used  to 
differentiate  persistent  from  transient  errors  and  to  separate  detected  errors  from  each 
other.  However,  there  are  some  inherent  drawbacks  in  this  method.  Repeated  occurrences 
of  errors  can  appear  quite  different,  since  the  same  fault  could  affect  different  components 
and  thus  cause  the  system  to  classify  the  error  in  different  groups.  Also,  the  method 
depends  heavily  on  statistical  techniques,  which  could  be  a  disadvantage  in  defining  a  broad 
framework.  Finally,  the  method  here  is  directed  largely  at  hardware,  rather  than  software 
errors,  which  may  not  be  so  easily  analyzed.  More  consideration  is  needed. 

The  task  of  creating  such  a  set  of  metrics  for  the  services  in  a  complex  system  is 
difficult.  The  dependency  between  the  properties  of  a  metric  and  its  usage  are  very 
important,  and  so  the  properties  of  the  metric  must  be  examined  and  tested  as  to  what  they 
measure  [Kearney86].  Validation  is  also  an  issue;  nonparametric  statistical  methods  (such 
as  the  use  of  contingency  tables)  appear  to  have  an  important  role  in  evaluating  metrics 
against  fixed  validity  criteria.  We  are  exploring  methods  for  validating  objective  functions 
as  a  measure  of  a  fault  management  mechanism’s  effectiveness,  as  well  as  for  error 
detection.  Among  the  criteria  that  being  considered  are  association,  consistency, 
discriminative  power,  tracking,  predictability,  and  repeatability  [Schneidewind92]. 
Choosing  a  testbed  for  adaptive  system  evaluation 

The  issues  to  be  considered  in  using  an  adaptation  paradigm  for  constructing 
survivable  systems  include  the  system  structure,  the  operational  profile,  the  operating 
environment,  and  the  status  of  the  system  once  a  failure  (actual  or  potential)  has  been 
discovered.  Abstract  analysis  of  these  factors  is  an  extremely  difficult  problem,  particularly 
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in  a  real-time  environment,  where  the  survivability  requirements  may  conflict  with  the  timing 
constraints  [Shin91].  We  therefore  intend  to  develop  a  prototype  survivable  C^I  system  in 
order  to  address  those  issues  most  important  to  the  construction  of  these  systems  in  a 
distributed  real-time  environment.  The  value  of  such  a  testbed  approach,  where  validation 
of  proposed  techniques  is  based  on  their  experimental  application  and  the  measurement  of 
the  results,  has  been  shown  in  several  research  projects  involving  reliable  real-time  systems 
(e.g.,  [Kim89]  and  [Wedde89]). 

The  first  step  in  constructing  this  prototype  is  the  selection  of  a  development 
platform  that  can  support  dependable  distributed  applications  that  exhibit  the  features  of 
a  C^I  system.  A  number  of  different  platforms  are  available,  and  the  features  offered  vary. 
Based  on  our  intent  to  use  this  prototype  as  a  feasibility  study  as  well  an  experimental 
evaluation  testbed,  we  have  defined  a  set  of  six  criteria  for  the  testbed  platform: 

•  The  software  system  should  allow  applications  to  execute  in  native  mode  on  the 
platform.  No  interpreter  or  secondary  system  should  be  necessary.  Any  secondary 
system  would  influence  the  experimental  data  we  intend  to  collect  from  the 
prototype. 

•  The  system  should  support  data  abstraction  and  information  hiding  in  a  well-defined 
form  such  as  objects  or  modules.  These  abstractions  should  exhibit  location 
transparency,  migration,  and  replication. 

•  The  system  should  support  transactions  or  some  other  form  of  atomic  action. 

•  The  system  should  either  be  fault  tolerant  itself  or  provide  tools  for  building  fault- 
tolerant  applications.  Automatic  recovery  is  desirable,  although  the  prototype  should 


4-15 


be  allowed  to  make  its  own  decisions  about  what  components  to  make  fault  tolerant 
and  which  method  is  used. 

Graceful  degradation  should  be  possible,  as  should  noninterrupting  repair  or 
replacement  of  objects  in  the  system. 

The  system  should  support  real-time  processing  to  the  extent  that  hard  and  soft 
deadlines  can  be  set  and  alarms  raised  if  the  deadlines  are  missed.  High 
performance  is  not  a  direct  issue,  but  the  ability  to  specify  a  level  of  performance  in 
terms  of  deadlines  is  essential. 
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Figure  2.  Results  of  platform  evaluation. 

We  have  used  these  criteria  to  evaluate  a  total  of  17  different  platforms  in  this  project.  The 
results  are  tabulated  in  Figure  2.  Based  on  this  evaluation,  we  chose  three  systems  for 
further  study. 
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ISIS  [Birman91]  is  a  distributed  systems  development  toolkit  that  extends  any  of  a 
number  of  UNIX  dialects.  While  not  directly  supporting  objects,  a  distributed  object  known 
as  a  group  can  be  defined  that  can  support  replication,  migration,  and  fault-tolerant 
communication.  ISIS  also  includes  a  package  called  META  that  allows  creation  of  soft  real¬ 
time  applications  [Marzullo91].  META  can  also  be  used  as  a  monitoring  tool  for 
performance  and  failure  in  an  application.  Upon  failure,  a  process  can  be  restarted  using 
information  kept  by  a  log  manager. 

Chaos  [Schwan90]  is  an  operating  system  designed  for  development  of  large,  complex 
real-time  applications.  Chaos  supports,  and  is  composed  of,  objects,  which  can  contain 
multiple  processes  distributed  over  several  nodes.  The  VGA  feature  of  Chaos  adapts  an 
executing  real-time  application  by  dynamically  moving  it  along  a  set  of  programmer-specified 
versions  of  the  application,  with  each  version  having  different  functionality  and  performance. 
The  goal  of  the  Chaos  system  is  to  achieve  the  programmer-specified  level  of  performance 
at  all  costs.  Its  fault-tolerant  support  is  limited. 

Clouds  [Dasgupta91]  is  another  distributed  operating  system  that  is  object-based. 
The  compilers  in  the  system  create  and  distribute  objects  to  data  servers,  which  are 
responsible  for  the  storage  of  objects.  The  same  object  can  be  concurrently  invoked  by 
threads  at  different  compute  servers,  supporting  replication  and  consistency.  No  mention 
of  a  migration  facility  was  made  in  the  source  material.  Atomicity  is  achieved,  however,  by 
specialty  threads  defined  by  a  locking  and  recovery  subsystem.  Support  for  graceful 
degradation  is  provided  for  multiple  objects  and  threads,  as  is  transparent  replacement. 
There  does  not  appear  to  be  strong  support  for  real-time  processing. 
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As  shown  in  Figure  2  above,  each  of  these  three  systems  has  some  advantages  and 
disadvantages  in  terms  of  the  criteria  for  an  evaluation  platform.  Among  these  three 
candidates,  however,  ISIS  appeared  to  have  most  of  the  features  we  required,  particularly 
the  support  of  fault  tolerance.  Since  ISIS  is  a  toolkit  rather  than  a  native-mode  operating 
system,  ISIS  applications  run  directly  atop  UNIX,  using  the  ISIS  facilities  mostly  for 
communication  and  coordination.  We  were  also  impressed  by  the  stated  capabilities  of 
META  for  run-time  monitoring  of  both  the  environment  (as  required  by  the  objective 
function)  and  system  attributes  (as  part  of  the  evaluation  of  different  techniques  for  fault 
management.) 

In  addition,  we  wanted  the  system  to  be  compatible  with  systems  available  at  Rome 
Laboratory,  so  that  our  experiments  can  be  ported  and  demonstrated  there.  The  Air  Force 
has  shown  interest  in  ISIS  as  one  of  the  platforms  for  its  survivable  system  experiments,  and 
the  commercial  version  of  the  system  is  available  to  academic  institutions  at  a  significant 
discount.  We  therefore  selected  ISIS  as  the  platform  for  development  of  the  testbed. 
PROJECT  STATUS 

Additional  details  of  the  work  described  in  this  report  are  documented  in  three 
papers  that  are  currently  under  review: 

•  a  description  of  the  taxonomy  for  fault  management  strategies  [Gantenbein93a], 

•  a  summary  of  the  use  of  the  objective  function  as  a  mechanism  for  relating  system 
requirements  to  system  behavior  [Gantenbein93b];  and 

•  an  overview  of  the  adaptive  fault  management  approach  to  survival  in  distributed 
systems  as  an  "innovative  idea"  report  [Gantenbein93c]. 
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An  fourth  paper  is  planned  for  submission  to  the  next  Symposium  on  Reliable  Distributed 
Systems  (SRDS)  early  in  1993. 

The  SRDS  and  the  annual  International  Symposium  on  Fault-Tolerant  Computing 
(FTCS)  are  very  highly  regarded  conferences  for  researchers  working  in  dependable 
computing.  The  principal  investigator  attended  both  of  these  conferences  in  1992  with  the 
support  of  the  AFOSR  Research  Initiation  Program  grant.  The  contacts  with  others  working 
in  the  field  was  very  valuable;  in  particular,  discussions  with  two  individuals  involved  with 
the  adaptive  fault  tolerance  project  ~  Dr.  K.H.  (Kane)  Kim  of  the  University  of  California 
at  Irvine  and  Dr.  Lawrence  Alexander  of  GE  Aerospace  —  provided  motivation  and  insight 
for  our  work  and  opened  avenues  for  further  coordination  and  collaboration. 

At  this  time,  our  work  continues  along  the  research  tracks  already  identified.  With 
internal  support  from  the  University  of  Wyoming  through  May  1993,  we  intend  to  complete 
the  population  of  the  taxonomy  and  refine  the  categories  for  better  separation  among  the 
mechanisms.  We  also  plan  to  formalize  the  definition  of  the  objective  function  and  extract 
the  most  important  features  of  our  current  model  for  evaluation  in  a  prototype  distributed 
application.  We  have  acquired  version  2.2  of  ISIS  and  will  soon  install  it  on  20  nodes 
through  the  Wyoming  campus  network.  This  configuration  will  be  used  as  the  platform  for 
a  prototype  distributed  application,  which  will  simulate  hostile  and  friendly  targets  entering 
and  leaving  the  range  of  a  set  of  radar  stations.  The  system  will  be  able  to  simulate  both 
mission  changes  and  resource  loss  to  allow  us  to  evaluate  different  strategies  for  survival  and 
demonstrate  the  feasibility  of  the  adaptation  mechanism  we  have  proposed. 
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CONCLUSIONS  AND  FUTURE  RESEARCH 

We  have  in  this  project  established  a  solid  framework  for  the  further  study  of 
adaptive  fault  management  as  an  approach  to  building  survivable  C^I  systems.  Our  next 
goal  is  to  develop  the  characterizations  of  fault  management  mechanisms  and  operating 
conditions  to  the  extent  that  the  mappings  between  them  can  be  quantified  and 
implemented  using  META  functions  in  our  prototype  application.  Once  these  metrics  are 
included  in  the  prototype,  we  will  be  able  to  simulate  various  scenarios  in  the  testbed  and 
measure  the  prototype’s  ability  to  adapt  to  and  survive  rapid  and  drastic  changes  in  the 
operating  environment. 

Our  research,  in  combination  with  other  work  sponsored  by  Rome  Laboratory,  forms 
a  cohesive  effort  in  the  area  of  adaptive,  survivable  systems.  The  relationships  among  these 
projects  can  be  described  by  the  research  schema  shown  in  Figure  3.  In  the  short  term,  our 
work  is  helping  build  a  general  framework  for  the  application  of  adaptive  fault  management 
to  survivable  C^I  systems.  Development  of  the  models  for  the  environments  and  fault 
management  mechanisms  will  lead  to  evaluation  metrics  for  these  systems,  while  the  testbed 
study  will  identify  ISIS  features  for  adaptivity  and  survival  that  will  be  used  by  Rome 
Laboratory  and  its  subcontractors,  GE  Aerospace  and  SRI  Associates,  in  developing 
demonstration  systems. 

In  the  long  term,  we  see  this  research  as  leading  towards  a  generalized  methodology 
for  the  dynamic  redefinition  of  system  behavior  with  respect  to  an  operating  environment 
that  may  change  over  time.  Not  only  fault  management,  but  other  system  components  (such 
as  task  scheduling  or  resource  management)  become  less  effective  when  the  assumptions 
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Figure  3.  An  adaptive  fault  management  research  schema, 
inherent  to  them  are  violated.  Eventually,  we  foresee  a  formal  design  paradigm  for  adaptive 
systems,  supported  by  this  methodology.  With  the  increased  reliance  to  be  placed  on 
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computer-driven  systems  in  the  future,  we  believe  that  this  approach  can  make  these  systems 
increasingly  dependable. 
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1  INTRODUCTION 


We  tested  optical  waveguides  in  glass  substrates.  These  waveguides 
will  be  used  in  ultra  high  speed  network  pulse  multiplexing  optical  cir¬ 
cuits  and  as  Coupled  Waveguide  Light  Amplifiers  (CWLA)  and  waveguide 
lasers.  They  can  also  be  used  as  External  Optical  Modulators  (EOMs).  The 
coupled  laser  can  be  controlled  by  applying  bias  voltages  across  the  lasing 
waveguides.  Both  free  space  and  fiber  optic  communication  channels  have 
wider  bandwidth  of  the  order  of  10^^  HZ,  than  any  other  communication 
medium.  At  present,  the  bandwidth  of  optical  communication  channels  is 
limit  by  the  electronic  devices  employed  at  the  input  and  output  of  the 
channels.  Current  electronic  devices  are  limited  to  a  bandwidth  of  about 
50  GHz  (5  X  10^°  Hz).  This  is  a  small  fraction  of  the  available  optical 
bandwidth.  These  devices,  in  conjunction  with  passive  optical  waveguides, 
can  be  used  as  all  optical  multiplexers  and  demultiplexers  that  would 
convert  multi  channel  parallel  "slow”  electronic  input  signals  to  high 
speed  serial  optical  signals  and  and  high  speed  optical  signals  to  parallel 
multi  channel  “slow”  electronic  signals.  As  stated  above,  the  coupled 
laser  can  be  controlled  by  applying  bias  voltages  across  the  lasing 
waveguides.  The  application  of  a  voltage  across  one  lasing  waveguide  will 
shift  the  frequency  of  the  light  from  the  particular  waveguide  with  re¬ 
spect  to  the  light  from  the  other  waveguide.  The  difference  frequency  of 
the  light  beams  from  the  two  waveguides  would  be  in  the  mm-wave  fre¬ 
quency  or  micro-wave  frequency  range.  The  electrical  bias  allows  contin¬ 
uous  control  of  the  difference  frequency.  Such  light  sources  have  applica¬ 
tions  in  Optical  Phased  Array  Antenna  Signal  Processors  (OPAASPs). 

We  also  tested  Tunneling  Devices  for  Optical  AND  Gate  applications. 
These  type  of  gates  are  used  in  Ultra  High  Speed  Fiber  Optical  Networks. 
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1  OPTICAL  WAVEGUIDES  IN  GLASS  SUBSTRATES 


a)  Planar  and  Ridge  Waveguides 

We  fabricated  two  types  of  waveguides.  We  fabricated  both  planar  and 
ridge  waveguides.  Planar  waveguides  are  constructed  by  doping  channels 
in  a  glass  substrate  just  below  the  surface  as  shown  in  Fig.  1.  Ridge 
waveguides  have  channels  of  doped  glass  that  protrude  above  the  sub¬ 
strate  surface.  Planar  waveguides  can  only  guide  light  around  large  radii, 
typically  about  10  cm  in  radius.  Ridge  waveguides  can  guide  light  around 
substantially  smaller  radii  of  only  a  few  millimeters.  The  light  input  and 
output  to  these  waveguides  is  via  single  mode  fibers.  Indeed  all  tests  to 
date  on  the  waveguides  were  performed  using  light  coupled  into  the 
waveguides  through  a  single  mode  fiber. 


Waveguide 


Fig.  1.  Planar  waveguide, 
b)  Waveguide  Application 

These  waveguides  have  many  applications.  One  of  which  is  in  pulse 
multiplexing  and  demultiplexing  networks.  A  multiplexing  network 
consists  of  a  single  input  waveguide  that  branches  into  a  number  of 
waveguides.  Each  waveguide  paths  has  a  different  optical  lengths  before 
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the  path  are  recombined.  The  differences  in  the  optical  path  length  are 
integer  multiples  of  the  shortest  path  length  difference.  Consider  a  pulsed 
optical  signal  with  a  repetition  rate  longer  than  the  longest  difference  in 
optical  delay.  Each  path  delays  the  pulse  by  a  different  time  interval.  When 
the  delayed  pulses  are  recombined  a  signal  with  a  much  higher  pulse 
repetition  rate  results.  The  realization  using  a  single  glass  substrate 
results  in  the  construction  of  precise  delays.  Also  since  all  the  delay 
elements  are  located  on  the  same  substrate  they  expand  and  contract  with 
temperature  in  unison. 


Waveguide 


Fig.  2.  Ridge  waveguide. 

However,  to  make  such  a  device  useful  the  pulses  have  to  be  not  only 
delayed  but  also  modulated.  Therefore  each  path  must  also  contain  a 
modulator  capable  of  switching  the  pulse  through  the  particular  waveg¬ 
uide  guide  on  and  off.  These  modulators  can  be  fabricated  by  either  de¬ 
positing  ZnO  on  the  waveguide  and  patterning  the  ZnO  into  a  Mach-Zhender 
interferometer  modulator  configuration  or  attaching  a  LiNbOg  Mach- 
Zhender  modulator  onto  the  waveguide.  Since  both  LiNbOg  and  ZnO  have 

substantially  higher  indices  of  refraction  than  glass  the  light  will  switch 
from  the  glass  into  the  ZnO  or  LiNbOg. 

We  built  and  tested  branching  planar  and  ridge  waveguides  on  glass 
substrates. 
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c)  Waveguide  Fabrication 


The  waveguide  is  fabricated  by  doping  channels  in  the  surface  of  opti¬ 
cal  quality  glass  with  silver.  Silver  increases  the  index  of  refraction  of 
the  glass.  Thus,  light  will  be  confined  to  the  doped  channels  in  the  glass. 
Since  the  increase  of  index  of  refraction  of  the  glass  is  small,  only  about 
0.1  to  0.05%  the  light  will  be  weakly  guided.  This  restricts  the  curvature 
of  the  guides  to  large  radii.  On  the  other  hand,  since  there  are  large  re¬ 
gions  on  the  sides  of  the  waveguides  where  the  light  intensity  decreases 
exponentially,  it  is  easy  to  construct  evanescently  coupled  waveguides. 

The  doping  was  accomplished  as  follows:  First,  a  chromium  film  was 
deposited  on  the  optical  quality  glass  substrate.  A  layer  of  photoresist 
was  deposited  on  the  chromium  film.  The  photoresist  was  exposed  through 
a  mask  and  developed.  This  leaves  photoresist  only  where  where  the 
waveguide  will  not  be.  The  chromium  film  is  next  etched  into  the  waveg¬ 
uide  pattern.  Thus,  this  chromium  film  served  as  a  mask.  Next,  a  silver 
metal  film  was  vacuum  deposited  on  the  chromium  mask  and  the  bare 
glass  waveguide  pattern.  The  silver  film  was  covered  with  a  second  vac¬ 
uum  deposited  chromium  film.  Another  chromium  film  was  deposited  on 
the  bottom  of  the  glass  substrate.  Electrodes  were  attached  to  the  top  and 
bottom  chromium  films.  A  voltage  of  about  10  Volts  was  applied  between 
the  chromium  film  electrodes  while  the  substrate  was  heated  to  350  °C 
for  2  hours.  During  this  process  the  silver  diffuses  into  the  glass  forming 
a  channel  with  a  higher  index  of  refraction  than  the  surrounding  glass. 
Silver  diffuses  much  more  readily  than  chromium  in  glass.  This  process  is 
known  as  electric  field  assisted  diffusion.  After  the  diffusion  process  is 
completed  the  chromium  films  were  etched  off.  This  process  results  in  a 
planar  waveguide. 

Ridge  waveguides  are  fabricated  by  first  fabricating  a  planar  waveg¬ 
uide.  Photoresist  is,  again,  deposited  on  the  glass  substrate.  The  photore¬ 
sist  is  exposed  through  the  same  mask  and  developed.  The  mask  is  aligned 

with  the  waveguide  using  alignment  marks.  This  leaves  photoresist  on  the 
waveguide.  The  glass  substrate  is  etched  using  HCI  in  HgO  in  a  1  to  4  ratio 
for  one  hour.  This  is  followed  by  etching  in  a  1  to  15  HCI  solution  in  HgO 
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for  5  minutes  to  smooth  the  waveguide  edges.  Lastly  the  remaining  pho¬ 
toresist  is  removed.  This  process  leaves  an  elevated  ridge  waveguide  on 
the  glass  substrate.  Ridge  waveguides  can  guide  light  around  much  smaller 
radii  of  curvature  than  planar  guide  can. 

d)  Waveguide  measurements 

We  have  measured  the  performance  of  the  waveguides.  Waveguides  in 
the  shape  of  a  “Y”  were  fabricated.  Both  ends  of  the  substrates  were  pol¬ 
ished  to  provide  flat  smooth  input  and  output  surfaces.  A  single  mode 
fiber  was  attached  to  the  single  leg  of  the  “Y”.  The  fiber  was  used  as  the 
light  input  from  a  HeNe  laser.  The  other  end  of  the  substrate  was  observed 
with  a  CCD  camera  connected  to  a  computer.  The  computer  generated 
three-dimensional  plots  of  the  light  intensity  as  a  function  of  position. 
The  pattern  produced  by  a  ridge  waveguide  is  shown  in  Fig.  3  and  the  out¬ 
put  from  a  planar  waveguide  is  shown  in  Fig.  4.  We  note  that  in  both  cases 
distinct,  approximately,  Gaussian  beam  profiles  are  observed. 
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Fig.  3.  Output  light  intensity  profile  of  "Y"  shaped  planar  waveguide. 
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Fig.  4.  Output  light  intensity  profile  of  "Y"  shaped  ridge  waveguide. 
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2  TUNNELING  AND  GATE 


We  have  fabricated  and  tested  several  photon  assisted  tunneling  diodes. 
The  devices  consisted  of  a  0.5  mm  wide  aluminum  strip  vacuum  deposited 
on  a  glass  substrate.  The  aluminum  strip  was  deposited  through  a  mask. 
The  aluminum  film  was  oxidized  in  ambient  air.  SiO  insulating  edge  pro¬ 
tection  layers  were  deposited  through  a  mask  onto  the  edges  of  the  alu¬ 
minum  strip.  Thin,  80  X  thick,  strips  of  aluminum  were  vacuum  deposited 
across  the  oxidized  aluminum  strip  with  edge  protectors.  These  top  alu¬ 
minum  contacts  are  thin  enough  to  be  transparent  to  light.  Copper  contacts 
were  deposited  on  the  thin  aluminum  strips  on  either  side  of  the  oxidized 
aluminum  strip,  see  Fig.  5.  The  edge  protection  prevents  shorting 


Fig.  5.  Tunneling  diodes. 
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at  the  edges  of  the  bottom  aluminum  strip. 

We  measured  the  tunneling  current.  All  tunneling  currents  measured 
were  quite  low.  The  oxide  layers  on  ail  devices  fabricated  were  much  to 
thick.  However,  we  were  able  to  observe  an  increase  in  the  tunneling  cur¬ 
rent  with  illumination  from  a  HeNe  laser.  The  signal  obtained  were  rather 
small.  The  reason  for  this  is  that  98%  of  the  light  is  reflected  and  only  2% 
is  absorbed  by  the  device  and  that  the  oxide  layer  was  very  thick.  The  light 
absorption  can  greatly  be  increased  by  using  an  anti-reflection  coating. 
Since  the  device  is  made  out  of  a  metal  a  standard  anti-reflection  coating 
could  not  be  used.  However,  an  etalon  consisting  of  a  thin  transparent  film 
polystirene  layer  coated  with  a  50  A  thick  aluminum  layer  was  tried.  Light 
absorption  is  increased  by  a  factor  of  about  ten  by  the  use  of  the  etalon. 
However,  the  thick  oxide  layer  still  limited  the  tunnelling  current. 

Since  the  late  1950s  various  attempts  have  been  made  to  fabricate 
Tunneling  Transistors.  This  device  consists  of  a  thick  metal  emitter,  a 
30  to  50  A  thick  insulating  oxide  layer,  a  thin  (20  to  100  A  thick)  metal 
base,  another  30  to  50  A  thick  insulating  oxide  layer,  and  a  metal  collec¬ 
tor  electrode.  This  is  a  Metal,  Oxide,  Metal,  Oxide,  Metal  (MOMOM)  device. 
Such  a  device  could  be  very  fast  since  the  quantum  mechanical  tunneling 
process  is  very  fast.  However,  to  date  no  successful  MOMOM  transistor  has 
been  build.  The  reason  these  device  did  not  work  is  that  if  the  metal  film 
that  form  the  base  is  thick  enough  to  conduct  horizontally  it  will  be  to 
thick  for  electrons  to  penetrate  the  film  from  insulator  to  insulator.  The 
electrons  are  scattered  in  the  metal.  It  is  possible  to  make  very  thin 
metal  base  layers.  However,  such  layers  don't  conduct  horizontally.  We 
propose  to  build  MOMOM  devices  with  exceedingly  thin  metal  base  layers 
only  10  to  20  A  thick.  We  plan  to  access  these  layers  optically  through  a 
thin  (100  A  thick)  collector  electrode  rather  than  trying  to  make  electri¬ 
cal  contact  to  them.  We  are  proposing  to  try  to  build  a  Photon  Assisted 
Tunneling  Transistor  (PATT).  Two  of  these  devices  can  be  connected  in 
series  to  form  a  Photon  Assisted  Tunneling  AND  Gate. 
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DEFECT  PROPERTIES  OF  PHOTOREFRACTIVE  AND  NON- 
PHOTOREFRACTIVE  BSO 

Wallace  B.  Leigh,  PhD 
Assistant  Professor 
Division  of  Electrical  Engineering 
Alfred  University 

Abstract 

Defect  properties  of  various  types  of  single-crystal  Bi^2^'^20 
(BSO)  were  investigated  to  determine  photorefracting  and  defect 
properties  of  these  crystals.  Crystals  investigated  included 
photorefracting  and  non-photorefracting  BSO  grown  by  Czochralski  (Cz) 
technique,  BSO  grown  by  Hydrothermal  (HTL)  technique,  and  Cz  BSO 
which  had  been  irradiated  by  gamma  radiation.  Above  room-temperature 
optical  absorption  measurements  indicate  an  anomalous  high  shift  of  the 
absorption  edge  with  temperature  for  the  HTL  material,  a  shift  attributed 
to  changes  in  the  localized  environment  of  the  oxygen-residing  lone  pair 
electrons.  Fit  of  absorption  data  to  the  Urbach  rule  indicates  the 
presence  of  different  impurity  bands  in  the  photorefracting  and  non- 
photorefracting  Cz  material;  impurity  bands  which  are  not  present  in  the 
HTL  material.  Furthermore  it  is  demonstrated  that  fit  of  the  absorption 
band  tails  to  the  Urbach  rule  may  be  used  as  a  method  of  characterizing 
impurity  content  in  BSO.  The  irradiated  Cz  material  had  been 
investigated  in  an  attempt  to  improve  the  photorefracting  efficiencies 
and  response  of  BSO  using  red  and  infra-red  pump  and  probe  beams 
rather  than  blue  and  red.  Clear  samples  of  Cz  and  HTL  BSO  were 
observed  to  be  photorefracting  using  632  nm  pump  beams,  with 
efficiencies  of  only  ~10‘®  and  response  times  ranging  from  seconds  for 
HTL  material  to  several  minutes  for  Cz  samples.  Gratings  formed  in  this 
manner  were  observed  to  fix  in  the  Cz  samples,  causing  the  long  erase 
times.  Fixing  was  not  observed  in  the  HTL  material.  These 
photorefracting  properties  were  not  observed  to  change  in  irradiated 
BSO  and  it  is  shown  that  the  small  percentage  change  in  absorption  due 
to  irradiation  cannot  be  expected  to  improve  photorefracting  properties 
in  the  long-wavelength  regions. 
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DEFECT  PROPERTIES  OF  PHOTOREFRACTI VE  AND  NON- 
PHOTOREFRACTIVE  BSO 

Wallace  B.  Leigh 

Introduction 

Photorefraction  is  a  method  of  writing  volume  holograms  which  are 
dynamic  in  nature;  that  is,  the  volume  gratings  eventually  decay 
following  cessation  of  exposure.  Photorefraction  is  observed  in  cubic 
insulators  such  as  Bii2Si02o  (BSO)  and  Bi-|2GeO20  (BGO)  ,  in 
ferroelectrics  such  as  BaTiOg;  and  in  cubic  semi-insulators  such  as 
lnP:Fe,  and  GaAs:Cr.  The  applications  of  photorefraction  in  these 
materials  include  many  promising  new  advances  in  image  processing 
and  optical  information  processing.  Some  of  these  applications  are 
already  bearing  fruit  with  new  products. 

Many  of  the  photorefractive  crystals  mentioned  are  being  grown 
at  Rome  Laboratories,  Hanscom  AFB  and  their  photorefractive  properties 
are  being  investigated  there  as  well.  Introductory  studies  of  defects  in 
BSO  and  their  relation  to  photorefraction  were  begun  by  this  author  at 
Rome  under  an  Air  Force  Summer  Faculty  Research  program.  This 
report  gives  continuing  progress  on  research  initiated  in  that  study,  on 
properties  of  different  types  of  photorefracting  BSO.  The  purpose  of  this 
study  was  twofold:  The  first  is  a  continuation  of  the  original  study  set  out 
to  investigate  the  properties  of  BSO  grown  by  the  Cz  and  HTL  methods 
at  Rome.  The  second,  in  an  attempt  to  lower  the  pump  and  probe  beam 
energies  to  red  and  infrared  regions,  the  photorefractive  properties  of 
Cz,  HTL,  and  irradiated  Cz  BSO  were  investigated  at  these  longer 
wavelengths. 
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Ct  and  HTL  BSO 

Several  samples  of  Cz  and  HTL  BSO  were  investigated  to  help 
identify  why  HTL  BSO  is  transparent  in  the  visible  and,  as  undoped,  has 
different  photorefracting  properties  compared  to  Cz  BSO.  Used  in  this 
study  were  two  samples  of  photorefracting  Cz  BSO  grown  from  4-9s 
purity  sources.  These  samples  were  observed  to  be  photorefracting 
using  455nm  radiation  as  a  pumping  source  and  632  nm  radiation  as  a 
probe,  with  efficiencies  of  10”^  and  response  times  of  several 
milliseconds.  In  this  report,  we  will  define  "short  wavelength 
photorefraction"  as  photorefraction  experiments  conducted  using  biue 
pump  and  red  probe  beams.  "Long  wavelength  photorefraction"  will  be 
defined  as  that  measured  under  red  pump  beams  and  infrared  probe 
beams,  and  results  of  these  investigations  will  be  addressed  later  in  the 
report.  Undoped  HTL  material  is  not  photorefracting  under  short 
wavelength  conditions.  Also  under  study  were  two  samples  of  low  purity 
Cz  BSO,  grown  with  only  3-9s  purity  sources.  This  material  has  not 
been  observed  as  photorefracting  under  any  conditions,  however  neither 
of  the  3-9s  purity  samples  were  sufficiently  clear  for  proper 
photorefractive  measurements.  Finally,  two  samples  of  HTL  BSO  were 
used  ,  one  of  which  is  photorefracting  at  long  wavelengths,  the  other, 
not  clear  for  photorefracting  measurements. 

An  earlier  thermally  stimulated  current  (TSC)  study[1]  of  these 
samples  was  conducted  and  results  are  shown  in  Fig.  1.  This  data 
shows  the  concentration  of  traps  of  activation  energy  <  0.7  eV  in  HTL 
material  is  approximately  a  factor  of  10  smaller  than  that  of  Cz  BSO. 
At  least  five  different  defects  were  identified  in  the  TSC  measurements. 
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While  a  large  concentration  of  shallow  traps  were  always  found  in  the  Cz 
material,  the  identity  of  the  dominant  shallow  trap  is  not  always  the 
same.  In  that  same  study,  high  temperature  electrical  conductivity  data 
indicated  a  thermally  activated  carrier  excitation  of  energy  -1.3  eV.  The 
Arrhenius  plot  of  this  data  is  shown  in  Fig  (2).  This  thermally  activated 
process  was  observed  for  both  Cz  and  HTL  material,  although  trap 
identity  and  concentration  measurements  were  not  available. 

Temperature  Dependent  Absorption. 

In  an  attempt  to  identify  absorption/conduction  mechanisms  in  the 
different  types  of  BSO,  temperature  dependent  optical  absorption 
studies  were  undertaken.  While  the  absorption  edge  for  the  transparent 
HTL  material  is  believed,  but  not,  as  yet  confirmed,  to  be  due  to  band- 
to-band  transitions,  that  of  the  Cz  material  is  unl^wn.  Absorption 
studies  as  a  function  of  temperature  help  identify  different  absorption 
and  conduction  processes  for  the  different  materiai  types.  Another 
motivation  for  temperature  dependent  absorption  studies  is  the  odd 
behavior  of  HTL  materiai  at  high  temperatures.  It  has  been  noted  that 
when  transparent  HTL  material  is  heated  to  moderate  temperatures,  it 
tends  to  color,  and  when  it  cools  it  becomes  transparent  again.  Cz 
material  does  not  produce  this  phenomenon,  and  instead  remains  the 
same  yellow  color  through  heating. 

To  analyze  this  phenomenon  as  well  as  the  general  conduction 
mechanisms  of  the  three  types  of  samples,  transmission/absorption  data 
was  measured  as  a  function  of  temperature.  A  standard  transmission 
apparatus  is  used  for  this  purpose,  with  the  sampie  mounted  on  a  copper 
block  containing  a  heating  element.  The  block  is  milled  to  fit  the  sample 
in  a  fashion  to  help  minimize  convection  currents.  This  apparatus  can 
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be  used  to  measure  transmission  data  for  temperatures  ranging  from  RT 
to  500°C.  A  quartz-halogen  light  source  is  used,  along  with  a  1/4  meter 
monochrometer  with  a  bandpass  of  0.4  nm.  A  silicon  photodetector  and 
lock-in  detections  system  is  used,  and  the  system  is  oomputer 
controlled. 

For  the  absorption  studies,  three  different  types  of  samples  of  BSO 
were  used:  Three  samples  were  of  high  purity  BSO,  two  samples  of  low 
purity  BSO  and  one.HTL  BSO  sample.  Absorption  data  has  been 
determined  from  the  relation 

(1) 

for  reflection  from  two  surfaces,  and  plotted  on  a  semilog  plot  for  the 
three  sample  types.  In  Eq.  (1),  t  is  the  sample  thickness  and  R  is  the 
reflectivity.  The  reflectivity  is  determined  by  measuring  the  transmission 
in  the  mid-ir  and  assuming  no  absorption  at  this  wavelength. 

Figure  3  shows  the  spectral  absorption  curves  for  different 
temperatures  for  three  samples.  BSO  9  (fig(a))  is  3-9s  pure  Cz  grown, 
BSO110H  (fig(b))  is  4-9s  pure  Cz  grown  and  BS11 0A(fig(c))  is  HTL 
grown  material.  All  values  of  a  are  in  m‘^.  The  change  of  color  in  HTL 
BSO  at  high  temperatures  is  shown  to  be  caused  by  the  shift  in  the 
absorption  edge  with  temperature.  Heating  up  to  -400°C,  the  sample 
turns  from  yellow  to  green  as  the  absorption  edge  moves  into  the  visible 
region  of  the  spectrum.  In  Figures  (a)  and  (b)  it  is  observed  that  the 
shift  with  temperature  is  not  as  great  for  Cz  BSO.  The  shift  for  HTL  BSO 
is  observed  to  be  linear  with  temperature  ,  and  is  measured  to  be 
-2X10'^  eV/°C.  The  absorption  edge  was  determined  from  the  intercept 
of  transmission  curves  with  the  x  axis.  For  Cz  material  the  shift  is  much 
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smaller,  -10“^  for  the  Cz  BSO  samples.  This  can  be  compared  to  ~10 
eV/°C  for  most  materials[p95l].  Furthermore,  for  Cz  BSO  the  shift  was 
not  linear  aithough  no  concise  relationship  could  be  determined  from 
these  samples. 

While  it  is  difficult  to  determine  the  nature  of  the  absorption  shift  for 
HTL  BSO,  it  is  likely  that  the  lone  pair  electrons  of  oxygen  play  a  role  in 
determining  the  band  structure.  It  has  been  found  that  in  amorphous 
semiconductors  containing  group  VI  elements,  anomalous  pressure  and 
temperature  variations  of  the  absorption  edge,  refractive  index  and  other 
optical  properties  have  been  explained  by  a  change  in  the  iocal 
environment  of  the  group  VI  element  which  contains  lone-pair 
electrons[3,4].  In  Cz  material,  the  absorption  is  mechanism  is  different, 
characterized  by  the  impurity  band  of  that  material. 

It  is  noted  that  the  "tails",  which  is  the  area  defined  as  the  band- 
edge  (the  high-energy  regions  of  the  curves  in  Fig.  3)  can  be  modeled  as 
an  exponential  function.  This  is  the  so-calied  "Urbach  rule"  and  the  data 
of  this  region  are  fit  to  the  equation 


a(E,T)  =  e 


-a(E  -E)/kT 


(2) 


Where  a(E,T)  is  the  absorption  coefficient  and  E  is  the  photon  energy. 
The  main  explanation  for  the  dependence  given  by  Eq(2)  is  that  the 
energy  of  the  excited  electron  states  varies  due  to  lattice  deformation  by 
lattice  vibrations.  Any  increase  in  the  deformation  has  an  effect  on  the 
measured  value  of  a.  In  this  way,  any  perturbation,  such  as  disorder  or 
the  presence  of  impurity  tails,  can  be  determined  in  a  somewhat 
qualitative  fashion  from  the  dependence  of  0  and  E^  with  temperature. 

Figure  4  shows  values  of  a  with  temperature  for  the  three  sample 
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types.  Again,  BSO  9  is  3-9s  pure  Cz  grown,  BSO110H  is  4-9s  pure  Cz 
grown  and  BS110A  is  HTL  grown  material.  There  is  a  linear  dependence 
of  o  with  temperature  for  all  three  sample  types,  and  the  slope  of  this 
dependence  falls  with  increasing  impurity  type. 

The  effect  of  impurity  concentration  on  absorption  tails  has  been 
considered  in  Ref[5-7].  For  crystals  containing  impurity  bands,  and  for  a 
certain  concentration  range  of  impurities,  there  is  an  effect  on  the 
variation  of  a  with  temperature.  Depending  on  the  shift  of  the 
absorption  edge  with  impurity  concentration,  the  effects  of  impurities  in 
a  band  is  to  either  decrease  or  increase  the  linear  dependence  of  a  with 
temperature.  Furthermore,  it  was  shown  in  reference  [5]  that  this  shift  is 
linear  with  impurity  concentration.  These  effects  are  due  to  phonon- 
assisted  absorption  processes.  As  the  impurity  band  represents  an 
added  deformation  potential,  the  change  in  sigma  can  be  noted. 

Obviously,  from  Fig.  4  this  condition  holds  for  BSO.  The  HTL  BSO 
has  been  shown  by  the  TSC  measurements  to  be  of  less  impurity  content 
compared  to  Cz  BSO.  The  relation  of  a  with  temperature  is  linear  in  the 
range  of  BSO  being  grown  at  Hanscom.  In  the  less  pure  Cz  material, 
the  slope  of  the  line  is  less,  correlating  with  an  added  impurity 
concentration.  A  more  detailed  study  may  prove  a  correlation  between 
impurity  content  and  the  a  vs.  T  variation.  This  technique  may  prove  a 
rapid  means  for  characterization  of  BSO. 

It  appears  from  Fig.  3  that  there  are  two  absorption  processes,  one 
at  longer  wavelengths  and  one  at  shorter  ones,  separated  by  a  "kink"  in 
the  absorption  curve.  The  reason  for  the  "kink"  in  the  absorption  curve 
is  as  yet  unknown.  It  is  interesting  to  observe  that  in  the  high-purity 
BSO  at  ~400°C,  one  absorption  process  disappears  in  favor  of  the  other 


(6-8) 


and  the  "kink"  disappears. 


Phntnrpfrartinn  at  lonner  wavelengths 

Most  experiments  used  to  characterize  the  photorefractive  properties 
of  BSO  are  performed  with  the  use  of  two  lasers:  a  "pump"  laser  source 
and  a  "probe"  laser  source.  Photorefraction  is  most  efficient  when  a 
pump  beam  of  ~500nm  (from  an  Argon  ion  source)  is  used  with  a  633  nm 
probe  beam  of  a  Helium-Neon  laser.  Nevertheless,  photorefraction 
does  occur  using  pump  and  probe  beams  of  longer  wavelengths[8,9] 
despite  the  small  (-O.Acm'"')  absorption  coefficient  at  these  frequencies. 
In  the  literature  it  was  postulated  that  some  of  the  more  shallow  levels 
were  responsible  for  the  photorefractive  properties  at  these 

wavelengths. 

Two  samples  of  Cz  BSO  were  irradiated  with  SOMrad  of  gamma- 
ray  radiation  in  order  to  analyze  radiation-created  defects  and  to  move 
the  absorption  edge  to  longer  wavelengths.  Fig.  5  shows  the  effect  of 
irradiation  on  the  absorption  edge  for  Cz  BSO.  By  moving  the 
absorption  edge  closer  to  633nm,  a  He-Ne  pump  beam  used  to  write 
photorefractive  gratings  would  have  higher  efficiencies  than  non- 

irradiated  BSO. 

The  photorefractive  setup  was  a  typical  non-degenerate  4  beam 
shown  in  Fig.  6.  A  10mW  633nm  polarized  HeNe  laser  was  used  as  a 
pump,  and  a  5mW  806nm  laser  diode  was  used  as  a  probe.  A  silicon 
detector  and  lock-in  detection  system  was  used  to  detect  the  diffracted 
probe  beam.  The  set-up  allows  an  incident  pump  angle  of  9.7°  which 
corresponds  to  the  formation  of  holographic  gratings  of  ~2  urn  spacing. 
The  incident  pump  irradiance  was  500  mW/cm^.  Chopping  frequency 
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was  250  Hz. 

Initially  observed  phenomena  with  the  irradiated  material  included 
the  tendency  for  the  damaged  area  to  cure.  Indeed,  after  ~3  weeks 
time  the  irradiated  material  had  returned  to  its  original  unirradiated 
condition.  Furthermore,  curing  seemed  to  be  accelerated  in  the 
presence  of  the  633nm  light.  No  additional  defects  were  observed  in 
TSC  measurements  of  the  irradiated  material,  and  the  only  major 
difference  observed  between  irradiated  and  unirradiated  BSO  is  in  the 
transmission  curves  of  Fig.  5. 

All  BSO  material  tested  was  observed  to  be  photorefractive 
using  the  set-up  in  Fig.  6,  with  the  exception  of  the  low-purity  Cz 
material.  This  includes  the  irradiated  and  non-irradiated  Cz  material  as 
well  as  HTL  material.  The  major  limitation  in  long-wavelength 
photorefraction  is  in  obtaining  a  material  which  is  totally  free  of 
occlusions,  cracks  or  other  physical  defects  from  which  the  infrared  light 
may  scatter.  The  efficiencies  of  photorefraction  at  these  wavelengths 
tend  to  be  quite  low,  on  the  order  of  10‘®,  which  is  on  the  order  of  the 
limits  of -fl**  detection  for  the  system  of  Fig.  6. 

As  the  laser  diode  has  a  higher  irradiance  than  the  pump  beam, 
there  is  a  tendency  for  the  probe  to  erase  the  grating  once  it  is  formed. 
The  order  of  the  experiment  then,  is  as  follows:  First  the  pump  beams 
are  allowed  to  form  the  grating  with  the  probe  beam  off.  The  probe 
beam  is  next  turned  on  and  efficiencies  are  then  recorded.  After 
equilibration  is  maintained,  one  of  the  pump  beams  is  turned  off  and  the 
grating  is  allowed  to  decay.  This  is  then  recorded  by  measuring  the 
decay  in  the  diffracted  probe  beam.  Care  was  taken  to  eliminate 
reflections  and  scattering  of  the  IR  light.  Several  measurements  were 
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taken  at  different  spots  on  the  sample  at  different  times. 

Figure  7  shows  the  recorded  diffracted  beam  from  a  Cz  sample 
where  red  beams  are  used  as  the  pumping  source.  Initial  rise  is  quite 
swift,  followed  by  a  slow  decline  as  the  probe  erases  the  grating.  After 
several  minutes,  one  of  the  red  beams  is  blocked  and  the  signal  from  the 
diffracted  probe  beam  is  allowed  to  decay.  In  Cz  material,  this  decay 
can  vary  anywhere  from  several  minutes  to  several  tens  of  minutes, 
varying  from  sample  to  sample  and  from  different  probed  areas  of  the 
sample.  This  long  delay  time  can  be  explained  as  a  "fixing"  of  the 
holograms  by  compensated  charges.  These  charges  make  up  the  so- 
called  "complementary"  grating  which  exists  long  after  the  primary 
grating  has  dispersed.  Fixing  can  occur  from  charge  compensation  by 
free  carriers[9,10]  or  by  drift  of  ions[11]  under  the  influence  of  the 
space  charge  electric  field  set-up  by  the  primary  grating.  In  short- 
wavelength  photorefraction  efficiencies  of  the  complimentary  gratings 
can  be  much  lower  than  that  of  the  primary,  but  since  primary 
efficiencies  are  low  under  long  wavelength  conditions,  it  is  more 
apparent  in  our  system. 

An  interesting  development  of  photorefraction  is  that  a  transparent 
HTL  sample,  which  is  not  photorefracting  at  short  wavelengths.  Is 
potorefracting  at  longer  wavelengths.  Furthermore,  the  fixing  of  the 
primary  by  a  complimentary  grating  is  not  observed  in  HTL  material. 
Thus  erasure  of  the  the  grating  in  the  HTL  sample  occurs  at  a  much 
faster  response  time  than  that  of  the  Cz  material.  Figure  8  shows  the 
initial  rise  of  the  diffracted  beam  followed  by  the  erasure  of  the  beam 
from  HTL  BSO  when  one  red  pump  beam  is  blocked.  The  relative 
efficiency  of  photorefraction  using  this  system  is  about  the  same  for  Cz 
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and  HTL  material. 

Although  it  appears  in  the  figure  that  not  all  the  grating  has  erased, 
several  other  measurements  on  this  sample  could  not  confirm  that  a 
complimentary  grating  ever  existed,  and  usually  the  signal  returns  to 
zero  once  the  primary  grating  is  erased.  This  represents  a  change  of 
several  orders  of  magnitude  in  the  time  to  erase  the  grating  as  compared 
to  Cz  material. 

Since  the  defect  population  as  determined  by  TSC  is  very  low  for 
levels  of  thermal  activation  energy  less  than  0.7  eV,  it  is  apparent  that  a 
trap  level  deeper  than  this  must  be  involved  in  the  HTL  photorefraction 
illustrated  in  Fig.  8.  Figure  2  shows  high  temperature  electrical 
conductivity  data  which  indicates  a  thermal  activated  carrier  excitation 
of  energy  -1.3  eV.  This  thermally  activated  process  is  observed  for  both 
Cz  and  HTL  material,  and  could  very  well  be  the  responsible  absorption 

process  for  photorefraction  at  long  wavelengths. 

As  for  the  absence  of  thermal  fixing  in  HTL  material,  either  the 
mechanism  of  photorefraction  is  different  in  HTL  samples  or  the  ions 
responsible  for  the  fixing  of  gratings  in  the  Cz  material  are  of  not  of  the 
same  concentration  or  diffusivity  in  HTL  material. 
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Conclusions 

Spectral  dependence  of  absorption  at  different  temperatures  has 
proven  that  hydrothermal-grown  BSO  has  a  large  shift  of  the  absorption 
edge  with  temperature.  This  shift  is  quite  possibly  characterizing  the 
change  in  bandgap  with  temperature.  The  magnitude  of  the  shift  is 
caused  by  changes  in  the  localized  environment  of  the  lone  pair 
electrons  residing  on  oxygen.  Fit  of  the  absorption  data  to  Urbach’s  rule 
has  shown  a  way  to  characterize  impurity  content  in  Cz  and  HTL  BSO. 
Photorefraction  at  long  wavelengths  in  both  Cz  and  HTL  BSO  has  been 
demonstrated.  Long-wavelength  photorefraction  may  be  characterized 
by  a  fixing  of  the  holographic  gratings  in  Cz  material,  which  is  not 
observed  in  HTL  material. 
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l  ist  of  Figures 


1  TSC  measurements  of  a)  Cz  BSO  sample  BSO  45,  and  b) 

Hydrothermal  sample  BS  40,  and  c)  Cz  sample  BSO  55,  grown 
from  a  hydrothermal  charge. 

2  Arrhenius  plot  of  the  initial  rise  of  high  temperature  conductivity  for 

different  BSO  samples.  Cross-sectional  areas  were  the  same  for 
all  samples  at  0.25  cm^. 

3  Log  absorption  vs'incident  energy  for  (a)  low-purity  Cz  sample  BS09 

(b)  high  purity  Cz  sample  BSO110H  and  (c)  HTL  sample  BS110A. 
Alpha  is  measured  in  m*^ 

4.  Plots  of  slopes  of  absorption  tails  vs  temperature  for  the  three  BSO 
sampies  of  Fig.  3. 

5  Transmission  spectrum  of  Cz  sample  BSO  51  before  (a)  and  after  (b) 

50Mrad  of  gamma-ray  irradiation. 

6  Basic  four  wave  mixing  apparatus  used  for  long-wavelength 

photorefraction  measurements. 

7  Long  wavelength  photorefraction  efficiency  measurement  of  a 

diffracted  IR  probe  beam  from  holographic  gratings  formed  in  Cz 
sample  BSO110H  using  633nm  pump  beams.  Initialiy,  the  grating 
is  aliowed  to  form  with  the  IR  beam  blocked.  Then  the  IR  probe  is 
unblocked  and  the  efficiency  recorded. 

8  Long  wavelength  photorefraction  efficiency  measurement  of  a 

diffracted  IR  probe  beam  from  holographic  gratings  formed  in  HTL 
sampie  BS110A  using  633nm  pump  beams.  Initially,  the  grating  is 
allowed  to  form  with  the  IR  beam  blocked.  Then  the  IR  probe  is 
unblocked  and  the  efficiency  recorded.  Also  shown  is  erasure  of 
the  grating  when  one  of  the  pump  beams  is  blocked. 
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Abstract 

The  underlying  goal  of  the  proposed  research  is  to  contribute  substantively  to  the  solution  of  a 
major  problem  facing  the  development  of  realistic  Natural  Language  Processing  (NLP)  systems, 
namely  the  construction  of  reasonably-sized  lexicons.  The  knowledge  for  use  in  NLP  systems 
must  be  acquired  and  represented  in  such  a  manner  that  its  implicit  semantics  are  then 
available  for  the  necessary  inferencing  of  the  host  system.  The  methodology  exploited  in  the 
research  herein  reported,  provides  one  plausible  alternative.  It  is  called  knowledgB  extraction 
(Liddy,  Kwasnik  &  Myaeng,  1989)  and  is  a  means  whereby  pre-existing  textual  sources  are 
processed  in  as  automatic  a  manner  as  possible  with  the  goal  of  extracting  the  knowledge  encoded 
in  natural  language  repositories.  This  approach  is  based  on  the  existence  of  detectable  linguistic 
patterns  which  predictably  reveal  the  presence  of  semantic  relations  and  derives  from 
sublanguage  theory  (Sager  et  al,  1987)  which  has  shown  that  within  a  particular  text-type, 
predictable  syntactic  and  semantic  regularities  develop  over  time,  and  these  regularities  are 
used  consistently.  The  meaning  to  be  extracted  consists  of  concepts  and  the  relations  among  these 
concepts.  Machine  Readable  Dictionaries  (MRDs)  form  an  excellent  and  readily  available  textual 
source  for  performing  semantic  knowledge  extraction. 

Our  efforts  have  been  focused  on  developing  patterns  that  can  be  used  in  algorithms  for  the 
automatic  recognition  of  semantic  relations  in  definitions  for  inclusion  in  a  semantic 
representation  for  use  by  NLP  systems.  The  specific  focus  of  this  research  effort  was  to  develop 
means  for  automatically  determining  in  a  verb  definition  which  of  a  verb's  senses  was  intended 
by  the  lexicographer  who  created  the  entry.  For  example,  the  most  frequently  used  verbs  in  a 
dictionary  are  used  in  thousands  of  definitions.  However,  each  verb  has  several  senses  and  it  is 
important  when  creating  a  semantic  representation  of  a  definition  to  know  which  sense  of  that 
verb  is  being  used  in  defining  the  verb  being  represented.  For  example,  the  word  'make'  is  used 
in  approximately  eight  hundred  definitions  in  our  machine-readable  dictionary  (LDOCE). 
However,  'make'  has  several  senses  (PRODUCE,  PERFORM,  or  CAUSE)  and  it  is  important  when 
creating  a  semantic  representation  of  a  definition  to  know  which  of  the  senses  of  'make'  is  being 
used  in  defining  the  verb  being  represented.  We  refer  to  the  patterns  we  developed  to  recognize 
specific  senses  as  Relation  Revealing  Formulae  (RRFs).  RRFs  couple  a  semantic  relation  to  the 
particular  linguistic  features  of  definitions  which  reliably  indicate  the  presence  of  that 
semantic  relation.  We  analyzed  the  seventeen  most  frequently  used  verbs  in  LDOCE  and  created 
recognition  templates  which  contain  RRF  which  we  developed.  These  RRF  will  be  applied  to 
unanalyzed  definitions  to  determine  in  each  definition,  which  sense  of  the  defining  verb  is 
intended.  The  RRF  are  based  on  1,912  verb  definitions  which  we  analyzed. 
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AUTOMATIC  CONSTRUCTION  of  a  SEMANTIC  LEXICON  for  use  in 
NATURAL  LANGUAGE  PROCESSING  SYSTEMS 

Elizabeth  D.  Liddy 
Diana  M.  Lauterbach 


Problem 

The  underlying  goal  of  the  proposed  research  is  to  contribute  substantively  to  the  solution  of  a 
major  problem  facing  the  development  of  Natural  Language  Processing  (NLP)  systems,  namely 
the  construction  of  reasonably-sized  lexicons.  As  stated  by  Velardi  et  al  (1991,  p.  157), 
"automatic  or  semi-automatic  acquisition  of  the  lexicon  is  a  critical  factor  in  determining  how 
widespread  the  use  of  natural  language  processors  will  be  in  the  next  few  years".  The  lack  of 
reasonably  sized,  semantically  rich  lexicons  is  posing  one  of  the  major  bottlenecks  in  NLP 
development  (Lenat  et  al,  1986)  and  is  due  to  the  absence  of  a  method  for  the  efficient 
acquisition  of  semantic  knowledge  of  the  world. 

The  knowledge  for  use  in  NLP  systems  must  be  acquired  and  represented  in  such  a  manner  that 
its  implicit  semantics  are  then  available  for  the  necessary  inferencing  of  the  host  system. 
Knowledge  acquisition,  carried  out  by  knowledge  engineers,  is  the  most  commonly  used  approach 
to  acquiring  this  knowledge,  but  it  has  proven  time-consuming  and  difficult  to  contain.  The 
methodology  exploited  in  the  research  herein  reported,  provides  one  plausible  alternative  to 
this  inefficient  process.  It  is  called  knowledge  extraction  (Kwasnik,  Liddy  &  Myaeng,1989), 
and  is  a  means  whereby  pre-existing  textual  sources  are  processed  in  as  automatic  a  manner  as 
possible  with  the  goal  of  extracting  the  knowledge  encoded  in  natural  language  repositories. 

This  approach  is  based  on  the  existence  of  detectable  linguistic  patterns  which  predictably 
reveal  the  presence  of  semantic  relations  and  derives  from  sublanguage  theory  (Sager  et  al, 
1987)  which  has  shown  that  within  a  particular  text-type,  predictable  syntactic  and  semantic 
regularities  develop  over  time,  and  these  regularities  are  used  consistently.  Sublanguages 
develop  amongst  a  group  of  language  users  who  share  a  common  purpose  and  who  must 
repeatedly  produce  instances  of  a  particular  type  of  text  (Kittredge  &  Lehrberger,  1982; 
Grishman  &  Kittredge,  1986).  Since  lexicographers  are  a  specialized  group  working  on  a 
common  task,  the  linguistic  features  of  dictionary  entries  can  be  analyzed  using  sublanguage 
methodologies  which  have  shown  themselves  successful  in  other  text-types  consisting  of  brief. 
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fairly  standardized  utterances  (Liddy  et  al,  1989;  Liddy  et  al,  1991). 


The  meaning  to  be  extracted  consists  of  concepts  and  the  relstions  among  these  concepts.  In  the 
English  language,  concepts  are  most  frequently  expressed  in  the  form  of  noun  phrases  while 
relations  among  concepts  are  commonly  expressed  by  either  verb  phrases  or  prepositions.  The 
concepts  present  in  dictionary  definitions  are  easier  to  recognize  automatically  than  the 
semantic  relations  that  exist  among  concepts,  because  the  language  used  in  dictionary  definitions 
only  implicitly  reveals  these  relations.  For  example,  a  definition  may  not  expressly  state  "the 
purpose  of  X  is  Y",  but  our  results  show  that  there  is  a  delineable  set  of  phrases  such  as  "X  is  an 
instrument  for  Y-ing"  in  which  the  phrase  "is  an  instrument  for"  reveals  the  PURPOSE  relation 
between  X  and  Y.  Relations  define  the  nature  of  the  interaction,  dependency,  influence  or  simply 
co-occurrence  which  holds  between  the  entities. 

Machine  Readable  Dictionaries  (MRDs)  form  an  excellent  and  readily  available  textual  source 
for  performing  semantic  knowledge  extraction  because:  1)  MRDs  are  culturally  validated 
sources  of  much  of  our  commonplace  knowledge  of  the  world  as  accepted  by  native  language 
speakers  over  many  years;  2)  MRDs  provide  raw  data  which  is  unbiased  by  a  particular 
individual's  beliefs,  views,  knowledge  or  experience,  and;  3)  MRDs  span  a  wide  range  of  subject 
areas,  thereby  providing  a  shallower,  but  more  complete  coverage  of  the  language  than  would  be 
available  in  other  corpora. 

MRDs  provide  a  rich  potential  for  knowledge  extraction,  although  it  has  proven  to  be  a  very 
difficult  task.  Even  more  pessimistically  stated  by  Alshawi  (1989),  "...providing  complete 
analyses  of  arbitrary  dictionary  definitions  is  beyond  the  current  state  of  the  art  in 
computational  linguistics."  We  have  not  found  the  task  impossible,  but  have  found  it  very  slow- 
going.  And  although  the  results  are  quite  intellectually  pleasing,  the  automation  of  the  task  is  not 
to  the  point  we  had  originally  expected.  The  size  of  the  task  can  perhaps  be  best  understood  when 
one  realizes  that  full,  well-received  research  papers  have  been  devoted  to  simply  understanding 
dictionary  phrases  of  the  form  "to  VERB  with  NP"  (Ravin,  1993). 

For  our  research,  we  used  the  1987  version  of  Longman's  Dictionary  of  Contemporarv  English 
(LDOCE).  The  1977  version  of  this  MRD  has  been  the  object  of  a  substantive  body  of  work 
(Alshawi,  1987;  Boguraev  &  Briscoe,  1987;  Wilks  et  al,  1987)  which  has  revealed  the 
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potential  richness  of  the  data  encoded  on  it.  LDOCE  is  particularly  appropriate  for  the  present 
work  because  it  was  developed  to  be  used  as  a  learner's  dictionary  by  non-native  English 
speakers  and,  to  accommodate  this  use,  is  based  on  a  defining  vocabulary  of  approximately  2000 
words.  Most  of  the  definitions  are  written  using  only  these  terms  and  any  word  used  in  a 
definition  that  is  not  on  this  list  is  indicated  by  a  type-font  code.  This  controlled  defining 
vocabulary  contributed  substantively  to  the  tractability  of  our  research. 

Figure  1  provides  some  examples  of  how  semantic  relations  are  implicitly  expressed  in  LDOCE 
definitions.  The  terms  following  the  parentheses  exist  in  relation  to  the  term  being  defined,  in 
the  relation  stated  in  capital  letters  within  the  parentheses.  The  name  of  the  relation  within  the 
parentheses  has  been  added  following  intellectual  analysis  of  the  definition.  The  phrases  in  each 
definition  presented  in  bold  print  are  the  lexical  clues  used  in  the  definition  which  suggest  the 
nature  of  that  particular  relationship.  For  example,  in  the  definition  of  diet,  the  GOAL  relation 
exists  between  diet  and  'become  thinner'  and  is  indicated  by  the  phrase  in  order  to. 


depress  -  to  make  [CAUSE]  less  active  or  strong 

smooth  -  having  [CHARACTERISTIC]  an  even  surface 

pitcher  -  a  player  who  [AGENT]  throws  the  ball  toward  the  batter 

stab  -  to  strike  forcefully  with  [INSTRUMENT]  the  point  of  something  sharp 

diet  -  to  eat  according  to  a  special  diet,  esp.  In  order  to  [GOAL]  become  thinner 

shoot  -  an  area  of  land  where  [LOCATION]  animals  are  shot  for  sport 

nail  fiie  -  a  small  instrument  with  a  rough  surface  for  [PURPOSE]  shaping  fingernails 

Fig.  1 :  Sample  Definitions  from  LDOCE 


Our  efforts  have  been  focused  on  developing  patterns  that  can  be  used  in  algorithms  for  the 
automatic  recognition  of  semantic  relations  in  definitions  for  inclusion  in  a  semantic 
representation  for  use  by  NLP  systems.  The  specific  focus  of  this  research  effort  was  to  develop 
means  for  automatically  determining  in  a  verb  definition  which  of  a  verb's  senses  was  intended 
by  the  lexicographer  who  created  the  entry.  For  example,  the  most  frequently  used  verbs  in  a 
dictionary  are  used  in  thousands  of  definitions.  However,  each  verb  has  several  senses  and  it  is 
important  when  creating  a  semantic  representation  of  a  definition  to  know  which  sense  of  that 
verb  is  being  used  in  defining  the  verb  being  represented. 
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We  refer  to  the  patterns  we  developed  as  Relation  Revealing  Formulae  (RRFs).  RRFs  couple  a 
semantic  relation  to  the  particular  linguistic  features  of  definitions  which  reliably  indicate  the 
presence  of  that  semantic  relation.  The  three  levels  of  RRF  which  we  worked  with  are  presented 
in  Figure  2  and  each  is  followed  by  a  sample  RRF  template  of  that  level. 


LEXICAL  RRF:  Require  straightforward  lexical  string  matches  wherein  the  exact  word  or 
phrase  which  has  been  established  as  a  lexical  clue  to  the  relation  is  matched 
in  the  definition.  More  complex  LEXICAL  RRF  require  the  recognition  of  non- 
adjacent  phrase  patterns. 

RRF:  LOCATION  =  ’where’  +  <  > 

SYNTACTIC  RRF:  Consist  of  either  sequences  of  words  of  specified  parts-of-speech  or 

combinations  of  lexical  string  matches  and  specified  parts-of-speech  and 
takes  advantage  of  the  grammar  codes  available  on  the  LDOCE  tape. 

RRF:  RESULTANT  =  ’to  make’  +  <adjective> 

SEMANTIC  RRF:  Depend  on  information  about  semantic  features  of  words  (e.g.  animacy) 
in  the  formulae  being  available  in  the  dictionary  entry  box  codes.  It  is 
typical  for  a  single  Semantic  RRF  to  include  lexical  and  syntactic 
matching  as  well  as  semantic  matching, 

RRF:  EXPERIENCER  =  ’a’/’an’  +  <animate  noun>  +  ’who’  +  <stative  verb> 

Fig.  2:  Three  Levels  of  Relation-Revealing  Formulae 


Methodology 

The  determination  of  the  predictable  patterns  which  reveal  semantic  relations  in  definitions, 
entails  a  labor-intensive,  rigorous,  intellectual  analysis  of  large  sets  of  definitions  for  the 
inductive  discernment  of  semantic  relations  which  are  predictably  indicated  by  the  three  levels 
of  linguistic  regularities  described  above. 

For  this  research  effort,  we  focused  our  attention  on  verbs.  Verbs  were  chosen  for  several 
reasons.  Firstly,  our  research  at  Rome  Lab  last  summer  began  the  current  effort  and  we  had 
chosen  verbs  for  that  effort,  partially  because  Michael  McHale,  a  Rome  Lab  researcher  was 
developing  a  principle-based  parser  (McHale,  1993)  which  requires  knowledge  of  the  semantic 
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role  of  nouns  for  various  verbs.  A  more  basic  reason  is  the  fact  that  verbs  are  central  to  many 
NLP  analysis  schemes,  such  as  case-frame  analysis.  Also,  much  of  the  early  work  on  better 
representation  of  dictionary  definitions  had  focused  on  nouns  and  we  wished  to  extend  this  work 
into  a  new  grammatical  category.  A  frequency  count  of  the  individual  verbs  which  occur  in  the 
verb  definitions  in  LDOCE  determined  the  set  of  verbs  on  which  we  would  focus  our  attention.  We 
analyzed  the  seventeen  verbs  with  the  highest  frequency  of  occurrence  in  LDOCE  definitions. 
They  are:  be,  become,  bring,  cause,  do,  force,  form,  give,  go,  have,  make,  move, 
piace,  put,  show,  take,  and  use.  Rather  than  analyzing  all  the  occurrences  of  each  of  these 
verbs,  we  looked  at  just  half  of  the  definitions,  reserving  the  other  half  as  a  database  against 
which  we  will  next  test  the  RRF  developed  during  this  project. 

We  developed  RRF  which  could  distinguish  the  various  senses  of  each  verb.  The  algorithms 
require  the  RRF  applicator  to  consult  the  online  LDOCE  file  to  determine  the  parts  of  speech  of 
the  words  in  the  definitions.  In  addition,  semantic  information  that  is  available  in  the  Box  Codes 
on  the  machine-readable  LDOCE  is  included  in  the  RRF  for  a  fuller  conceptual  representation  of 
the  definitions.  Fuller  detail  of  the  process  and  results  of  developing  RRF  for  the  verb  'make'  are 
available  In  my  final  report  to  RDL  In  1991,  entitled  "Development  of  a  Methodology  for 
Extracting  Semantic  Relations  from  Definitions".  Figure  3  provides  a  few  definitions  which  use 
the  verb  'make'  and  indicate  the  type  of  semantic  information  which  can  be  added  to  a  definition 
by  applying  the  RRF. 

[PAflENT]  serve  in  one  oT the  armed  forces  by  iaw 
damp  to  make  [CAUSE]  a  stringed  musical  instrument  [AFF.OBJ]  sound  less  loudly,  e.g. 
by  using  a  damper 

defuse  to  make  [CAUSE  TO  BE]  less  dangerous  [RES.ATT]  or  harmful  [RES.ATT] 

convey  to  make  [CAUSE  TO  BE]  feelings,  ideas,  thoughts,  etc.  [AFF.OBJ]  known 

[RES.ATT] 

copy  to  make  [PRODUCE]  a  copy  [RES.CONC]  of 

crackle  to  cause  to  make  [PRODUCE]  small  sharp  repeated  sounds  [RES.ABS] 

AFF.OBJ  =  Affected  Object  RES.ATT  =  Resulting  Attribute 

RES.CONC  =  Result,  Concrete  RES.ABS  =  Result,  Abstract 


Fig  3:  Definitions  with  relations  detectable  by  RRF  for  'to  make' 


Procedures 


The  RRF  were  developed  for  one  verb  at  a  time,  using  the  following  procedures.  The  verb 
definitions  were  extracted  from  LDOCE  and  Key  Word  in  Context  (KWIC)  programs  were  run  for 
each  verb's  set  of  definitions,  to  enable  us  to  look  for  similarities  in  syntax.  Any  definitions  that 
were  for  the  noun  sense  of  a  verb  (e.g.  ’cause'  is  both  a  noun  and  a  verb)  or  used  the  verb  later 
in  the  definition  and  not  as  the  defining  verb  were  eliminated.  The  intellectual  analysis  began  by 
first  grouping  definitions  based  on  shared  sense  of  the  defining  verb.  The  like  definitions  were 
then  grouped  together  based  on  similar  syntax.  This  work  was  guided  by  checking  the  senses 
found  in  LDOCE  and  other  dictionaries  to  see  the  senses  for  the  verb  being  analyzed.  This  means 
that  if  the  verb  under  analysis  was  'cause'  and  the  hundreds  of  definitions  being  analyzed  were 
those  in  which  ’cause'  occurred,  then  the  possible  senses  of  'cause'  were  checked  in  the 
dictionaries  to  serve  as  a  possible  guide  to  senses  that  might  possibly  occur  in  the  definitions 
which  used  'cause'  as  their  defining  verb.  Generally,  we  found  that  only  a  subset  of  the  possible 
senses  of  a  verb  found  in  the  dictionaries  were  actually  used  for  defining  other  verbs. 

After  determining  the  senses  of  a  verb  which  occurred  in  the  definitions,  an  initial  Relation 
Revealing  Formula  (RRF)  for  each  sense  was  developed.  If  RRF’s  were  similar  within  a  verb,  an 
attempt  was  made  to  combine  RRFs  into  one.  This  initial  breakdown  of  senses  was  done  for  each 
of  the  seventeen  verbs.  If  the  sense  developed  was  more  complex  than  the  verb  it  was  trying  to 
define,  then  further  analysis  was  done  to  see  if  another,  simpler  sense  could  be  substituted. 


Twenty  definitions  verbs  were  left  as  'questionnables'  at  the  end  because  the  definitions  did  not 
fit  into  any  one  sense.  This  is  a  relatively  very  small  proportion  of  the  total  definitions  that 
were  analyzed.  Other  verbs  had  'idiom'  senses.  For  example  the  verb  'place'  has  an  'associate' 
sense  because  the  definition  uses  'to  place  a  particular  meaning  on'.  Idioms  are  those  senses 
which  have  idiosyncratic  meanings  which  are  not  based  on  a  compositional  analysis  of  the  words 
used  in  the  definition. 

At  the  same  time  that  a  sense  was  being  selected  for  each  definition,  the  remainder  of  the 
definition  was  being  analyzed  to  determine  the  semantic  roles  played  by  other  words  and  phrases 
in  the  definitions.  In  several  instances,  the  sense  of  the  verb  was  based  on  the  semantics  and 
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syntactics  of  the  words  following  the  verb.  For  example,  in  definitions  containing  'to  be'  + 
ADV/ADJ,  the  adverb  or  adjective  conveys  the  CHARACTERISTIC  relation  and  the  sense  of  the 
verb  BE  is  EXIST. 

After  all  the  verbs  were  broken  down  by  senses  and  initial  RRF's  were  written,  a  matrix  was 
created  to  present  an  overview  model  of  how  verbs  share  senses  and  to  show  how  the  RRF  for  one 
sense  of  a  verb  differs  from  the  RRF  for  other  senses  of  the  same  verb.  The  idiom  senses  were 
separated  out  into  a  separate  matrix.  Some  verbs  took  "higher  level"  verbs  (i.e.  the  sense  is  the 
same  as  a  lexical  verb)  as  a  sense.  Also,  the  verb  'place'  has  a  PUT  sense  and  vice  versa. 

Although  this  may  be  confusing  initially,  we  could  not  find  any  way  around  it,  as  the  lexical 
verbs  which  were  used  as  senses  convey  the  meaning  of  that  sense  better  than  any  other  term  we 
could  find.  This  problem  is  similar  to  the  circularity  problem  encountered  by  lexicographers 
for  all  dictionaries.  Appendix  1  is  the  final  verb  matrix  and  presents  in  summary  form  the 
individual  verbs  that  were  analyzed  and  the  senses  for  each  verb  that  were  used  in  the 
dictionary  definitions.  Appendix  2  is  the  matrix  for  the  idiom  senses  of  the  verbs. 

The  lexical  expressions  (e.g.  EXIST,  CHANGE  PLACE,  CAUSE  TO  BE)  used  to  label  or  characterize 
the  senses  were  chosen  from  several  sources.  The  first,  and  most  often  used  source,  was  the 
verb  definitions  from  the  print  version  of  LDOCE.  Other  sources  referred  to  included  bLasK 
Webster's  Dictionary  and  the  Merriam  Webster's  Thesaurus. 

Once  a  verb  sense  was  identified  and  an  initial  Relation  Revealing  Formula  (RRF)  created,  a 
decision  was  made  whether  the  sense  was  an  idiom.  From  a  total  of  34  different  senses,  sixteen 
are  designated  as  idiomatic.  Eight  of  the  seventeen  verbs  analyzed  have  an  idiomatic  sense.  Four 
verbs  have  one  idiomatic  sense,  one  verb  has  two  idiomatic  senses,  two  verbs  have  three 
idiomatic  senses  and  one  verb  has  four  idiomatic  senses. 

In  overviewing  the  senses  found  in  this  analysis  (the  column  labels  in  the  Appendices),  the 
following  summary  interpretation  is  offered.  The  sense  EXIST  has  four  subcategories  -  EXIST  as 
a  CHARACTERISTIC,  CONDITION/POSITION,  ROLE/FUNCTION,  or  STATE.  The  difference  of  each 
subcategory  is  designated  syntactically  in  the  RRF.  For  example,  EXIST  as  a  CHARACTERISTIC  is 
followed  in  a  definition  by  either  an  ADJ  or  ADV.  For  the  subcategory  CONDITION/POSITION,  the 
part  of  speech  found  after  the  verb  is  the  preposition  'in/as/at/on'  plus  a  Noun  Phrase. 
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The  senses  CAUSE  and  CAUSE  TO  BE  were  found  to  be  the  senses  occurring  most  frequently  across 
the  seventeen  verbs,  with  seven  verbs  having  CAUSE  senses  and  five  verbs  having  CAUSE  TO  BE 
senses.  CAUSE  TO  BE  is  a  sense  for  five  verbs  when  the  definition  contains  a  specific  action  and 
outcome.  For  example,  the  CAUSE  TO  BE  formula  for  the  verb  BRING  contains  'to  bring  under/ 
into'  plus  a  NP.  The  verb  PUT  also  uses  a  similar  RRF  for  the  CAUSE  TO  BE  sense.  Four  verbs 
use  both  CAUSE  and  CAUSE  TO  BE  senses.  Two  verbs  have  two  separate  senses  of  CAUSE  which  are 
distinguished  by  their  RRF.  One  verb  has  two  separate  senses  of  CAUSE  TO  BE  also  distinguished 
by  its  RRF.  This  distinction  is  necessary  since  each  of  these  senses  comprise  the  largest  number 
of  verb  definitions. 

There  are  four  manifestations  of  a  CHANGE  sense.  First,  two  verbs  use  CHANGE  which  entails 
changing  something  or  someone  into  something  else.  Two  other  uses  of  the  underlying  semantics 
of  the  notion  of  change  are  CHANGE  PLACE  and  CHANGE  POSITION.  CHANGE  PLACE  indicates  a 
change  of  location  while  CHANGE  POSITION  indicates  an  adjustment  in  physical  position  but  the 
location  remains  the  same. 

Another  manifestation  of  the  notion  of  change  is  CAUSE  TO  CHANGE  PLACE  which  refers  to  a 
change  of  location  as  caused  by  someone  or  something.  The  verb  'move'  uses  the  three  senses 
CHANGE  PLACE,  CHANGE  POSITION,  and  CAUSE  TO  CHANGE  PLACE. 

The  verb  MOVE  also  uses  a  TRANSFER  sense,  as  do  two  other  verbs.  The  TRANSFER  sense 
involves  the  relocation  of  someone  or  something  from  one  place  to  another.  The  verb  definition 
usuaily  includes  the  place  something  was  moving  from  or  to.  The  relations  respectively  are 
LOCATION-SOURCE  and  LOCATION-GOAL 

However,  the  majority  of  senses  were  used  by  only  one  verb.  For  example,  the  senses  EFFECT, 
PROVIDE,  PUVCE,  PUT,  CAUSE  TO  POSSESS,  EXPERIENCE,  PERFORM,  PRESENT,  INDICATE, 
MUST,  PUSH,  EMPLOY,  and  REMOVE  were  only  used  once. 

Most  idiom  RRF's  included  wording  that  was  very  consistently  repeated  throughout  the 
definitions  containing  the  idiomatic  sense.  For  example,  for  the  idiomatic  sense  ASSOCIATE  for 
the  verb  PLACE  every  definition  contain  the  words  'a  particular  meaning  on'.  This  wording  then 
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became  part  of  the  RRF  and  the  sense  ASSOCIATE  is  idiomatic  for  the  verb  PLACE  since  the 
wording  is  not  found  in  any  other  definitions. 

RESULTS 

The  results  that  are  reported  herein  are  very  detailed  and  likely  of  interest  only  to  those  who 
have  been  involved  in  or  plan  to  undertake  a  similar  analysis  of  how  different  senses  of  defining 
verbs  can  be  detected  in  definition.  The  cursory  reader  is  directed  to  the  Verb-Sense  Matrix  in 
Appendix  1  and  the  Verb-Idiom  Matrix  in  Appendix  2.  In  the  next  few  pages,  our  results  will  be 
presented  for  each  verb  analyzed.  First,  the  actual  RRF  for  the  verb  will  be  presented  in  brief 
formulae,  to  be  followed  by  a  more  narrative  explanation  of  the  findings  that  supported  the 
development  of  these  RRF. 

First,  a  few  lines  of  explanation  of  how  the  various  fonts,  capitalization,  bracketing,  and 
abbreviations  are  used  in  the  following  detailed  report  of  the  results.  The  verb  being  reported 
on  occurs  in  bold,  capital  letters  and  is  followed  by  a  number  indicating  the  frequency  of 
occurrence  of  that  verb  as  the  defining  verb  in  one-half  of  the  definitions  in  LDOCE.  The 
remaining  half  of  the  definitions  are  reserved  to  provide  the  database  against  which  these  rules, 
when  automated,  will  be  tested. 

The  senses  are  in  all  caps  to  the  left  of  the  equal  sign;  parts  of  speech  are  also  in  all  caps  but 
they  appear  only  to  the  right  of  the  equal  sign  and  are  more  familiar  to  readers;  semantic 
relations  are  in  all  caps  within  square  brackets;  the  words  as  they  actually  appear  in  definitions 
in  LDOCE  are  in  single  quotes;  optional  items  are  in  curly  brackets;  alternatives  are  separated 
by  diagonals.  As  an  example,  the  first  set  of  RRF  are  for  the  verb  BE  has  six  RRF.These  are 
based  on  the  200  definitions  in  one-half  of  LDOCE  in  which  BE  served  as  the  first  and  defining 
verb.  The  first  RRF  is  for  the  sense  REPRESENT.  This  sense  can  be  recognized  in  LDOCE 
definitions  by  the  occurrence  of  the  words  'to  be'  followed  by  one  of  the  following  expressions:  'a 
sign'  or  'a  picture'  or  'a  symbol'.  Each  of  these  three  expressions  may  be  followed  by  either  the 
word  'that'  or  the  word  'of.  In  all  instances,  the  next  item  in  the  definition  will  be  a  NP  (Noun 
Phrase)  which  exists  in  the  semantic  relation  to  the  word  being  defined  as  an  [ABSTRACT 
OBJECT].  Optionally,  the  NP  will  be  followed  by  a  PP  (Prepositional  Phrase). 
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BE  (defining  verb  in  200  definitions) 


REPRESENT  =  'to  be'  +  'a  sign/picture/symbol  {that/of}'  +  NP  [ABSTRACT  OBJECT]  +  {PP} 

EXIST  =  'to  be'  +  ADJ/ADV  [CHARACTERISTIC]  +  {'to'  +  VB  [ACTION]}  +  {NP  [OBJECT/  PATIENT]}  + 
{PP} 

EXIST  =  'to  be'  +  ADJ/ADV  [CHARACTERISTIC]  +  {PP} 

EXIST  =  'to  be'  +  PAST  PARTICIPLE  [STATE]  +  {PP/ADV} 

EXIST  =  'to  be'  +  'in/as/at/on'  +  NP  [CONDITION/POSITION] 

EXIST  =  'to  be’  +  NP  [ROLE/FUNCTION]  +  [P.P.] 

The  verb  BE  occurred  in  200  definitions  and  conveys  two  basic  senses  in  ali  its  occurrences, 
one  of  which  is  idiomatic.  Of  the  total  number  of  occurrences,  only  13  are  idiomatic.  When  'to 
be'  plus  the  words  'sign/picture/symbol  are  used  the  verb  being  defined  conveys  the  idiomatic 
sense  of  REPRESENT.  For  example,  the  word  'betray'  is  defined  as  'to  be  a  sign  of  [REPRESENT] 
'something  one  would  like  to  hide'. 


The  remaining  definitions  convey  the  sense  of  the  verb  BE  as  EXIST  which  is  itself  sub¬ 
categorized  into  four  senses  and  detectable  by  five  RRF.  The  RRF  are  separable  by  the  different 
parts  of  speech  following  'to  be.'  If  it  is  an  ADJ  or  ADV,  then  the  sense  is  EXIST- 
CHARACTERISTICj.  For  example,  the  word  'linger'  is  defined  as  'to  be  [EXIST]  slow 
[CHARACTERISTIC]  to  disappear.'  Another  example  is  the  word  'suffice'  which  is  defined  as  'to  be 
[EXIST]  enough  [CHARACTERISTIC]'.  There  are  two  RRF's  for  the  EXIST-CHARACTERISTIC  sense, 
since  some  definitions  also  contain  a  VB  [ACTION]  and  NP  [OBJECT/PATIENT]. 

If  the  definition  contains  a  PAST  PARTICIPLE,  then  the  sense  is  'exist'  [STATE].  For  example,  the 
word  'cook'  is  defined  as  'to  be  [EXIST]  prepared  [STATE]  in  this  way  [MANNER].  As  another 
example,  the  word  'sell'  is  defined  as  'to  be  [EXIST]  bought  [STATE].' 

If  the  definition  contains  one  of  the  following  PREPOSITIONS  'in/as/at/on',  plus  a  NP,  then  the 
sense  is  'exist'  [CONDITION].  For  example,  the  word  'conflict'  is  defined  as  'to  be  [EXIST]  in 
opposition  [CONDITION].' 

If  the  definition  contains  a  NP,  then  the  sense  is  'exist'  [ROLE/FUNCTION].  For  example,  the 
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word  'preside'  is  defined  as  'to  be  [EXIST]  in  charge  of  [FUNCTION].'  Whereas,  the  word  reign 
is  defined  as  'to  be  [EXIST]  the  king  or  queen  [ROLE].'  The  NP  representing  a  role  can  also 
identify  a  person. 

BECOME  (defining  verb  in  46  definitions) 

CHANGE  =  'to  become' 

CHANGE  =  'to  become'  +  NOUN  [I.D.]  +  {PP} 

CAUSE  TO  BE  =  'to  become'  +  {ADV}  +  ADJ  [CHARACTERISTIC]  +  {ADV/PP} 

CAUSE  TO  BE  =  'to  become'  +  (ADV)  +  PAST  PARTICIPLE  [STATE]  +  {NP}  +  (PP] 

The  verb  BECOME  has  two  senses,  with  each  sense  having  two  RRFs.  The  first  sense  is  CHANGE. 
CHANGE'S  first  RRF  consists  simply  of  the  phrase  'to  become.'  For  example,  the  word  'get'  is 
defined  as  'to  become  [CHANGE].'  The  second  RRF  for  the  sense  CHANGE  contains  a  NP  labeled  as 
an  I.D.  For  example,  the  word  'father'  is  defined  as  '{of  a  man)  to  become  [CHANGE]  the  father 
[I.D.]  of.' 

The  base  sense  for  the  verb  BECOME  is  'cause  to  be.'  Its  two  RRF's  are  similar  in  structure, 
except  one  contains  an  ADJ  which  indicates  the  presence  in  the  definition  of  a  CHARACTERISTIC 
and  the  PAST  PARTICIPLE  reveals  a  STATE  relation.  For  example,  the  first  RRF  occurs  in  the 
definition  for  'languish',  which  is  defined  as  'to  become  [CAUSE  TO  BE]  weak  [CHARACTERISTIC], 
or  unhappy  [CHARACTERISTIC]  through  desire  [MEANS].'  In  the  second  RRF,  the  past  participle 
is  labeled  a  STATE.  For  example,  the  word  'crumble'  is  defined  as  'become  [CAUSE  TO  BE]  ruined 
[STATE].' 

BRING  (defining  verb  in  55  definitions) 

CHANGE  PLACE  =  'to  bring'  +  {NP  [OBJ]}  +  ADV  [DIRECTION/LOCATION]  +  ... 

TRANSFER  =  'to  bring'  +  {NP  [CONC.OBJ]}  +  'from'  +  {NP  [LOC-SOURCE]}  +  {'to'}  +  {NP  [LOC-GOAL]} 
CAUSE  =  'to  bring'  +  {NP  [OBJ]}  +  'to'  +  NP  [STATE]  +  {PP/ADV} 

CAUSE  =  'to  bring'  +  NP  [STATE]  +  ’to/on/in'  +  {NP  (PATIENT)} 

CAUSE  TO  BE  =  'to  bring'  +  'under/into'  +  NP  [STATE]  +  {PP} 

RETURN  =  'to  bring'  +  {NP  [PATIENT]}  +  'back'  +  {(NP)  [OBJ]}  +  {'to/into'}  +  {NP  [STATE/LOC-GOAL]} 


7-13 


The  verb  BRING  has  five  senses,  one  of  which  is  idiomatic.  Three  of  the  RRFs  are  quite 
similar,  in  that  the  senses  they  indicate  are  themselves  quite  similar  and  basic  variants  of  the 
base  sense  CHANGE  PLACE.  The  RRF  for  the  base  sense  differs  in  that  it  denotes  a  direction  or 
location.  For  example,  the  word  'puli'  is  defined  as  'to  bring  [CHANGE  PLACE]  something 
[OBJECT]  along  behind  one  [DIRECTION]  while  moving  [MANNER].' 

A  similar  RRF  is  for  the  sense  TRANSFER,  but  instead  of  DIRECTION,  the  RRF  contains  a 
LOCATION-SOURCE  and  a  LOCATION-GOAL  relation.  For  example,  the  word  'get'  is  defined  as  'to 
bring  [TRANSFER]  from  one  place  [LOC-SOURCE]  to  another  [LOC-GOAL].' 

Another  sense  for  the  verb  BRING  is  CAUSE  and  is  represented  by  two  RRFs.  The  first  RRF 
signifies  bringing  an  OBJECT  to  a  STATE.  And  the  second  RRF  signifies  bringing  a  STATE 
'to/on/in'  a  PATIENT.  For  example,  the  word  'promote'  is  defined  as  'to  bring  [CAUSE]  goods 
[OBJECT]  to  public  notice  [STATE]  in  order  to  encourage  people  to  buy  [REASON].’  An  example 
for  the  second  RRF  is  the  word  'cure'  which  is  defined  as  'to  bring  [CAUSE]  health  [STATE]  to  a 
person  [PATIENT]  in  place  of  disease  or  illness  [CONCESSION].' 

A  similar  sense  for  the  verb  BRING  Is  CAUSE  TO  BE.  The  NP  in  the  RRF  is  also  labeled  a  STATE. 
For  example,  the  word  'disrupt'  is  defined  as  'to  bring  [CAUSE  TO  BE]  into  disorder  [STATE].' 

The  idiomatic  sense  for  the  verb  BRING  is  RETURN  and  is  represented  in  the  RRF  by  including 
the  word  'back'  in  each  definition.  For  example,  the  word  "restore"  is  defined  as  'to  bring  back 
[RETURN]  into  use  or  existence  [STATE].’  This  sense  is  idiomatic  since  the  locational  sense  of 
back  is  not  at  ail  what  is  intended  in  this  definition. 

CAUSE  (defining  verb  in  377  definitions) 

EFFECT  STATE  =  'to  cause  to'  +  [NP  [OBJECT/PATIENT]}  +  {ADV}  +  VERB  [STATE]  +  (NP  [ABSTRACT 
OBJECT]}  +  [ADV/PP] 

EFFECT  ACT  =  'to  cause  to'  +  (NP  [OBJECT/PATIENT]}  +  VERB  [ACT]  +  (NP  [CONCRETE  OBJECT]}  + 
{ADJ/ADV/PP} 

PRODUCE  =  'to  cause'  +  NP  [OBJECT]  +  'on/to' 
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The  verb  CAUSE  can  covey  two  basic  senses,  but  one  of  these  is  sub-categorized  further,  so 
there  are  three  RRFs  as  a  resuit.  The  base  sense  is  to  EFFECT  but  is  differentiated  by  the 
definitions  either  effecting  a  STATE  or  an  ACT.  For  example,  the  word  'render*  is  defined  as  'to 
cause  to  [EFFECT]  be  [STATE].'  Or  the  word  'persuade'  is  defined  as  'to  cause  to  [EFFECT]  believe 
[STATE]  or  feel  certain  [STATE].' 

The  RRF  for  EFFECT-ACT  is  similar  syntactically  but  the  definitions  containing  this  RRF  are 
distinctly  different  in  meaning  conveyed.  For  example,  the  word  'vaporize'  is  defined  as  'to  cause 
to  [EFFECT]  change  [ACTION]  into  vapour  [CONCRETE  OBJECT].'  Or  the  word  'rain'  is  defined  as 
'to  cause  to  [EFFECT]  fall  [ACTION]  like  rain  [CONCRETE  OBJECT].' 

The  second  sense  for  the  verb  CAUSE  is  PRODUCE  and  occurs  in  only  two  definitions.  An 
example  is  the  word  'bruise'  which  is  defined  as  '  to  cause  [PRODUCE]  a  bruise  [ABSTRACT 
OBJECT]  on.' 

DO  (defining  verb  in  34  definitions) 

PERFORM  =  'to  do'  +  [ADV]  +  (NP  [ACTIVITY]}  +  [ADV/PP] 

The  verb  DO  occurs  in  only  34  definitions  and  its  uses  in  definitions  always  convey  the  sense 
PERFORM.  Since  'to  do'  denotes  an  action,  an  action  word  was  chosen  to  for  the  sense  as  well.  As 
an  example,  the  word  'experiment'  is  defined  as  'to  do  [PERFORM]  an  experiment  [ACTIVITY].' 

FORCE  (defining  verb  in  22  definitions) 

CAUSE  =  'to  force'  +  [NP  [PATIENT]}  +  {ADV}  +  ("to-  +  VERB  [ACTION]}  +  (NP  [RESULT]}  +  {PP/ADV} 

CAUSE  =  'to  force'  +  (NP  [(OBJ)}  +  {ADV}  +  {'into'}  +  {NP  [RESULT]}  +  {PP/ADV} 

PUSH  =  'to  force'  +  {(NP)  [PATIENT]}  +  [ADV  [MANNER]}  +  [NP  [OBJ]}  +  {PP} 

The  verb  FORCE  can  convey  two  senses  CAUSE  and  PUSH  with  the  majority  of  the  definitions 
conveying  the  CAUSE  sense.  Also,  there  are  two  RRFs  for  the  CAUSE  sense.  One  RRF  contains  an 
optional  VERB  and  includes  a  PATIENT  performing  an  ACTION  or  an  ACTION  causing  a  RESULT. 
Even  though  the  word  FORCE  seems  have  a  connotation  of  action,  the  basic  sense  of  the  word  is 
CAUSE  For  example,  the  word  'frogmarch'  is  defined  'to  force  [CAUSE]  a  person  [PATIENT]  to 
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move  [ACTION]  forward  [DIRECTION]  with  the  arms  held  together  firmly  from  behind 
[MANNER].'  And  the  word  'muzzle'  is  defined  as  'to  force  [CAUSE]  to  keep  [ACTION]  silent 
[RESULT].' 

The  second  RRF  for  the  CAUSE  sense  of  FORCE  entails  an  OBJECT  forced  into  a  RESULT.  For 
example,  the  word  'compress'  is  defined  as  'to  force  [CAUSE]  a  substance  [OBJECT]  into  less 
space  [RESULT]'  or  'squash'  is  defined  as  'to  force  [CAUSE]  into  a  fiat  shape  [RESULT].' 

The  other  sense  of  FORCE  is  PUSH,  which  conveys  the  manner  in  which  the  FORCE  occurs.  For 
example,  'dislodge'  is  defined  as  'to  force  [PUSH]  out  [MANNER]  of  a  position  [LOCATION- 
SOURCE]'  or  'overtax'  is  defined  as  'to  force  [PUSH]  beyond  [MANNER]  a  limit  [RESULT].' 

FORM  (defining  verb  in  26  definitions) 

PRODUCE  =  'to  form'  +  {'into'}  +  (NP  [RESULT]}  +  {ADV/PP}  +  {'by'  [MEANS]} 

CAUSE  =  'to  form'  +  NP  [PATIENT]  +  VERB  [ACTION]  +  NP  [RESULT] 

The  verb  FORM  has  two  senses,  PRODUCE  and  CAUSE,  but  the  basic  sense  is  PRODUCE.  The  sense 
CAUSE  occurs  in  but  one  definition  and  is  the  only  definition  containing  the  verb  FORM  with  a 
person  in  the  definition.  The  word  'dress'  is  defined  as  'to  form  [CAUSE]  soldiers  [PATIENT]  to 
form  [ACTION]  a  straight  line  [RESULT].'  The  RRF  for  the  sense  PRODUCE  signifies  there  is  a 
RESULT  from  the  action  FORM.  For  example,  the  word  'stack'  is  defined  as  'to  form  [PRODUCE] 
into  a  neat  pile  [RESULT].'  Or  the  definition  may  not  contain  the  specific  RESULT  but  tells  how 
the  implied  result  is  accomplished  via  the  MEANS  relation.  For  example,  the  word  forge  is 
defined  as  'to  form  [PRODUCE]  by  heating  and  hammering  [MEANS].' 

GIVE  (defining  verb  in  155  definitions) 

CAUSE  [AN  EFFECT]  =  'to  give'  +  NP  [EFFECT]  +  {ADV/PP} 

PROVIDE  =  'to  give'  +  {NP  [RECIPIENT]}  +  {VERB}  +  {NP  [OBJECT]}  +  {PP/ADV} 

PROVIDE  =  'to  give'  +  NP  [ABSTRACT  OBJ]  +  {'to'}  +  NP  [RECIPIENT]  +  {PP} 

TRANSFER  =  'to  give'  +  {NP  [CONCRETE  OBJECT]}  +  {NP  [RECIPIENT]}  +  {PP} 

EMIT  =  'to  give'  +  'out'  +  {NP  [ABSTRACT  OBJECT]}  +  {PP} 
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FORGO  =  'to  give'  +  'up'  +  {ADV}  +  {(NP)} 

BIRTH  =  'to  give'  +  'birth"  +  {ADV}  +  NP  [PATIENT] 

The  verb  GIVE  has  six  different  senses,  three  of  which  are  idiomatic.  The  three  base  senses  are 
CAUSE,  PROVIDE,  and  TRANSFER.  The  three  idiomatic  senses  are  EMIT,  FORGO,  and  BIRTH.  The 
base  sense  of  the  verb  GIVE  is  PROVIDE  which  is  revealed  by  two  RRFs.  The  first  RRF  signifies 
someone  [RECIPIENT]  is  being  provided  with  something  [OBJECT].  For  example,  the  word 
'promote'  is  defined  as  'to  give  [PROVIDE]  someone  [RECIPIENT]  a  higher  position  or  rank 
[OBJECT].'  The  other  RRF  signifies  something  [OBJECT]  is  being  given  to  someone  [RECIPIENT]. 
In  many  definitions  the  recipient  is  implied.  For  example,  the  word  'inform'  is  defined  as  to 
give  [PROVIDE]  information  or  knowledge  [OBJECT]  to.'  The  word  'shelter'  is  defined  as  'to  give 
[PROVIDE]  shelter  [OBJECT]  to.'  The  OBJECT  can  be  either  abstract  or  concrete. 

Another  sense  of  GIVE  is  TRANSFER  and  its  meaning  as  well  as  its  RRF  differs  from  PROVIDE 
because  the  object  being  given  to  the  RECIPIENT  is  CONCRETE  and  sometimes  includes  a 
LOCATION-SOURCE  and  LOCATION-GOAL.  For  example,  the  word  'hand'  is  defined  as  'to  give 
[TRANSFER]  from  one's  own  hand  [LOC-SOURCE]  into  someone  else's  [LOC-GOAL].'  The  word 
'present'  is  defined  as  'to  give  [TRANSFER]  something  [OBJECT]  away.'  The  TRANSFER  sense 
contains  an  implied  movement  of  an  object. 

The  last  sense  of  GIVE  Is  CAUSE.  The  definitions  show  that  'to  give'  an  ABSTRACT  OBJECT  can  be 
seen  as  to  CAUSE  AN  EFFECT.  For  example,  the  word  'tint'  is  defined  as  'to  give  [CAUSE]  a  slight 
or  delicate  color  [EFFECT]  to.'  The  word  'feel'  is  defined  as  'to  give  [CAUSE]  or  produce  the  stated 
sensation  [EFFECT].' 

As  for  the  idiomatic  senses  of  GIVE,  the  RRF  for  EMIT  contains  the  idiomatic  phrase  to  give  out. 
For  example,  the  word  'glow'  is  defined  as  'to  give  out  [EMIT]  heat  [OBJECT]  and/or  soft  light 
[OBJECT]  without  flames  or  smoke  [MANNER].'  The  next  sense  FORGO  is  the  largest  of  the 
idiomatic  groups  and  the  RRF  contains  the  phrase  'to  give  up.'  For  example,  the  word  'yield'  is 
defined  as  'to  give  up  [FORGO]  control  [OBJECT]  of.'  The  last  idiomatic  sense  BIRTH  has  only 
three  definitions.  The  RRF  contains  the  phrase  'to  give  birth.'  For  example,  the  word  'foal'  is 
defined  as  'to  give  birth  [BIRTH]  to  a  foal  [PATIENT].' 


GO  (defining  verb  in  73  definitions) 

CHANGE  PLACE  =  'to  go’  +  {NP  [OBJ]}  +  {ADV  [DIRECTION]}  +  {PP}  +  (NP  [OBJ/PATIENT/  LOC-GOAL]} 
DEPART  =  'to  go  away'  +  {ADV  [MANNER]}  +  {PP} 

DEFY  =  'to  go  against'  +  NP  [OBJ] 

EXCEED  =  'to  go'  +  ADV  [DEGREE]  +  {NP  [ABST-OBJ]} 

The  verb  GO  has  four  distinct  senses,  three  of  which  are  idiomatic.  The  base  sense  for  the  verb 
GO  is  CHANGE  PLACE  and  has  a  complex  RRF,  with  many  variations.  For  exampie,  the  word 
'sneak'  is  defined  as  'to  go  [CHANGE  PLACE]  quietly  and  secretly,  [MANNER]  so  as  not  to  be  seen 
[PURPOSE].'  Or  the  word  'decline'  is  defined  as  'to  go  [CHANGE  PLACE]  from  a  better  [LOC- 
SOURCE]  to  a  worse  position  [LOC-GOAL].' 

The  first  idiomatic  sense  of  GO  is  DEPART  and  is  represented  in  the  RRF  by  the  words  to  go 
away.'  For  example,  the  word  'begone'  is  defined  as  'to  go  away  [DEPART]  at  once  [MANNER]. 

The  second  idiomatic  sense  is  DEFY  and  is  represented  in  the  RRF  by  the  words  'to  go  against.' 

For  example,  the  word  'disoblige'  is  defined  as  'to  go  against  [DEFY]  the  wishes  [OBJECT]  of.'  The 
third  idiomatic  sense  is  EXCEED  and  contains  an  ADV  labeled  as  a  DEGREE  in  the  RRF.  For 
example,  the  word  'overflow'  is  defined  as  'to  go  (EXCEED]  beyond  [DEGREE]  the  limits 
[ABSTRACT  OBJECT]  of.' 

HAVE  (defining  verb  in  63  definitions) 

POSSESS  =  'to  have'  +  {NP  [OBJECT]}  +  {PP} 

CAUSE  =  'to  have'  +  NP  [EFFECT]  +  {'on'}  +  {NP  [PATIENT/OBJECT]}  +  {PP} 

EXPERIENCE  =  'to  have'  +  NP  [ABST  OBJ]  +  {NP  [PATIENT]}  +  {PP} 

MUST  =  'to  have  to'  +  {VERB  [ACTION]}  +  {P.P.} 

CAUSE  TO  BE  =  'to  have'  +  PAST  PARTICIPLE  [ACTION]  +  {PP}  +  {NP} 

The  verb  HAVE  has  five  senses,  one  of  which  is  idiomatic.  The  base  sense  is  POSSESS  and  is 
signified  in  the  RRF  by  the  pattern  'to  have'  +  OBJECT.  For  example,  the  word  'merit'  is 
defined  as  'have  [POSSESS]  a  right  to  [OBJECT].'  Or,  the  word  'suit'  is  defined  as  'to  have 
[POSSESS]  the  right  qualities  [OBJECT].' 
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Another  sense  for  the  verb  HAVE  is  CAUSE.  When  this  sense  is  intended,  'have'  is  followed  in  the 
definition  by  a  NP  representing  an  EFFECT.  For  example,  the  word  'impact'  is  defined  as  'to  have 
[CAUSE]  an  impact  [EFFECT]  on.  A  similar  sense  is  CAUSE  TO  BE  but  is  different  because  the 
definition  contains  a  PAST  PARTICIPLE  which  is  labeled  as  an  ACTION.  For  example,  the  word 
'deserve'  is  defined  as  'to  have  [CAUSE  TO  BE]  earned  [ACTION]  by  one's  actions  [MANNER]. 

The  second  largest  group  of  definitions  containing  HAVE  convey  the  sense  EXPERIENCE.  The 
definitions  are  different  from  the  previous  senses  because  the  word  being  defined  conveys  more 
of  an  action  than  a  state.  For  example,  the  word  'ache'  is  defined  as  'to  have  [EXPERIENCE]  a 
continuous,  but  not  violent  pain  [OBJECT].'  Or,  the  word  'violate'  is  defined  as  'to  have 
[EXPERIENCE]  sex  [OBJECT]  with  a  woman  [PATIENT]  by  force  [MEANS].' 


The  idiomatic  sense  of  HAVE  is  MUST  and  is  distinguished  in  the  RRF  by  having  'to  have  to’  in  the 
definition.  For  example,  the  word  'owe'  is  defined  as  'to  have  to  [MUST]  pay  [ACTION]  for 
something  already  done  [PURPOSE].' 

MAKE  (defining  verb  in  426  definitions) 

PRODUCE  =  'to  make'  +  NP  [ABSTRACT  OBJECT/CONCRETE  OBJECT] 

CAUSE  =  'to  make'  +  NP  +  VERB 
CAUSE  TO  BE  =  'to  make'  +  NP  +  ADJ 

The  verb  MAKE  conveys  three  distinct  senses  and  three  distinct  RRFs.  The  base  sense  of  the 
verb  is  PRODUCE  and  can  be  the  production  of  either  a  CONCRETE  OBJECT  or  an  ABSTRACT 
OBJECT.  For  example,  the  word  'draw'  is  defined  as  'to  make  [PRODUCE]  pictures  [CONCRETE 
OBJECT]  with  a  pencil  or  pen  [INSTRUMENT.]'  Or  the  word  'grin'  is  defined  as  'to  make 
[PRODUCE]  a  wide  smile  [ABSTRACT  OBJECT].' 

The  second  sense  for  the  verb  MAKE  is  CAUSE  and  includes  a  second  VERB  in  each  of  the 
definitions.  For  example,  the  word  'enforce'  is  defined  as  'to  make  [CAUSE]  something  [OBJECT] 
happen  [ACTION]  esp.  by  threats  or  force  [MEANS].'  Or  the  word  'shape'  is  defined  as  'to  make 
[CAUSE]  a  piece  of  clothing  [CONCRETE  OBJECT]  fit  the  body  [ACTION]  closely  [MANNER].' 

The  third  sense  for  the  verb  MAKE  is  CAUSE  TO  BE  and  contains  an  ADJ  or  PAST  PARTICIPLE  to 
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distinguish  the  RRF.  For  exampie,  the  word  'enable'  is  defined  as  'to  make  [CAUSE  TO  BE] 
possible  [RESULTING  ATTRIBUTE].'  Or  the  word,  'declare'  is  defined  as  'to  make  [CAUSE  TO  BE] 
known  [RESULTING  ATTRIBUTE]  publicly  or  officially  [MANNER].' 

MOVE  (defining  verb  in  117  definitions) 

CHANGE  POSITION  =  'to  move'  +  (NP  [OBJECT/(PATIENT)]}  +  {ADV/PP  [MANNER]} 

CAUSE  TO  CHANGE  PLACE  =  'to  move'  +  (NP  [OBJ/PATIENT]}  +  {ADV}  +  PP 
TRANSFER  =  'to  move'  +  (NP  [OBJECT]}  +  {ADV}  +  PP  [LOC-SOURCE/LOC-GOAL] 

CHANGE  PLACE  =  'to  move'  +  ADV  [DIRECTION]  +  [NP  [LOC-GOAL]}  +  .. 

The  verb  MOVE  is  represented  by  four  quite  similar  senses:  CHANGE  POSITION,  CAUSE  TO 
CHANGE  PLACE,  TRANSFER,  and  CHANGE  PU\CE.  The  largest  group  of  definitions  is  represented 
by  the  sense  CHANGE  POSITION.  For  example,  the  word  'wiggle'  is  defined  as  'to  move  [CHANGE 
POSITION]  in  small  side-to-side,  up-and-down,  or  turning  movements  [MANNER].'  Or  the  word 
'rocket'  is  defined  as  'to  move  [CHANGE  POSITION]  at  very  great  speed  [MANNER]. 

The  next  largest  group  of  definitions  containing  MOVE  is  represented  by  the  sense  CAUSE  TO 
CHANGE  PLACE.  For  example,  the  word  'occupy'  is  defined  as  'to  move  [CAUSE  TO  CHANGE  PLACE] 
into  and  hold  possession  of  (a  place),  [OBJ]  e.g.  by  military  force  [MEANS].' 

The  other  two  senses,  TRANSFER  and  CHANGE  PLACE  represent  approximately  the  same  number 
of  definitions.  For  the  sense  TRANSFER,  the  word  'jump'  is  defined  as  'to  move  [TRANSFER] 
suddenly  [MANNER]  from  one  point  [LOC-SOURCE]  to  another,  [LOC-GOAL]  often  missing  out 
what  comes  in  between  [RESULT].  And,  for  the  sense  CHANGE  PLACE,  the  word  'recede  is  defined 
as  '(of  a  thing)  to  move  [CHANGE  PLACE]  back  or  away  [DIRECTION].' 

PLACE  (defining  verb  in  13  definitions) 

PUT  =  'place'  +  (NOUN  [PATIENT/OBJ]  +  {ADV/PP} 

GIVE  =  'to  place'  +  Noun  Phrase  [CONCRETE  OBJECT] 

ASSOCIATE  =  'to  place'  +  'a  particular  meaning  on' 

The  verb  PLACE  has  three  senses,  one  of  which  is  idiomatic.  All  three  senses  have  a  small  group 
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of  definitions,  with  the  base  sense  being  PUT.  For  example,  the  word  'settle'  is  defined  as  'to 
place  [PUT]  (someone  or  oneself)  [PATIENT]  so  as  to  be  comfortable  [PURPOSE].'  Most 
definitions  for  the  sense  PUT  have  a  PATIENT  following  the  verb. 

Another  sense  of  PLACE  is  GIVE  and  its  RRF  contains  the  VERB  plus  a  CONCRETE  OBJECT.  For 
example,  the  word  'commission'  is  defined  as  'to  place  [GIVE]  a  special  order  [CONC  OBJ]  for,  or 
appoint  someone  to  do  (something).' 

The  idiomatic  sense  for  the  verb  PLACE  is  ASSOCIATE  and  its  RRF  contains  the  words  'a 
particular  meaning  on.'  For  example,  the  word  'interpret'  is  defined  as  'place  [ASSOCIATE]  a 
particular  meaning  on.' 

PUT  (defining  verb  in  143  definitions) 

PLACE  =  'to  put'  +  {NP  [OBJ]}  +  'under/in/into/through/onto'  +  NP  [OBJECT/LOC-GOAL] 

CHANGE  =  'to  put'  +  (NP)  [ABSTRACT  OBJECT]  +  {ADV}  +  'into'  +  NP  [RESULT] 

CAUSE  =  'to  put'  +  (ADV  [DEGREE]}  +  NP  [CONDITION]  +  'to/on'  +  .. 

CAUSE  TO  BE  =  'to  put'  +  {(NP)  [PATIENT/OBJ]}  +  'into/under/  out/in'  +  NP  [CONDITION] 

COVER  =  to  put'  +  'on'  +  NP  [OBJ} 

COVER  =  'to  put’  +  NP  [OBJ]  +  'on/over/onto*  +  (NP  [LOC-GOAL]}  +  {...} 

COMBINE  =  'to  put'  +  'together'  +  {...} 

The  verb  PUT  has  six  senses,  two  of  which  are  idiomatic.  Just  as  the  base  sense  for  PLACE  is 
PUT,  the  base  sense  for  PUT  is  PLACE.  The  definitions  for  the  PLACE  sense  usually  contain  the 
preposition  of  'under/in/into/through/onto'  as  indicated  in  the  RRF.  For  example,  the  word 
'immerse'  is  defined  as  'to  put  [PLACE]  deep  into  a  body  of  liquid  [LOC-GOAL].'  Or,  the  word 
'blockade'  is  defined  as  'to  put  [PLACE]  under  a  blockade  [LOC-GOAL].' 

The  second  sense  of  PUT  is  CHANGE.  The  definitions  which  convey  the  CHANGE  sense  all  contain 
an  ABSTRACT  OBJECT  which  is  represented  in  the  RRF.  For  example,  the  word  'encode'  is  defined 
as  1o  put  [CHANGE]  a  message  [ABSTRACT  OBJECT]  into  a  code  [RESULT].' 

Two  other  senses  of  PUT,  CAUSE  and  CAUSE  TO  BE  are  somewhat  similar  but  have  two  distinct 


7-21 


RRFs.  Both  senses  may  contain  a  CONDITION,  but  definitions  for  CAUSE  TO  BE  may  have  two  NFS. 
For  example,  the  word  'disgrace'  is  defined  as  'to  put  [CAUSE  TO  BE]  a  public  person  [PATIENT] 
out  of  favour  [CONDITION].'  Or,  the  word  'restore'  is  defined  as  'to  put  [CAUSE  TO  BE]  back  into 
a  former  position  [CONDITION].'  The  CAUSE  sense  may  have  an  ADV  signifying  a  DEGREE.  For 
example,  the  word  'overtax'  is  defined  as  'to  put  [CAUSE]  too  great  [DEGREE]  a  tax  [CONDITION] 
on  goods  [OBJECT].' 

The  first  idiomatic  sense  of  PUT  is  COVER  and  is  detected  in  definitions  by  two  semantically 
very  similar  RRFs.  The  first  RRF  signifies  putting  on  something;  and  the  second  RRF  signifies 
putting  something  'on/over/onto'  something.  For  example,  the  word  'dress'  is  defined  as  'to  put 
[COVER]  on  formal  clothes  [OBJECT]  for  the  evening  [REASON].'  And  the  word  'plaster'  is 
defined  as  'to  put  [COVER]  wet  plaster  [OBJECT]  on.' 

The  second  idiomatic  sense  of  PUT  is  COMBINE  and  is  indicated  by  the  word  together  in 
definitions.  For  example,  the  word  'string'  is  defined  as  'to  put  [COMBINE]  together  onto  a  thread 
[OBJECT]  so  as  to  form  a  string  [PURPOSE].' 

SHOW  (defining  verb  in  40  definitions) 

INDICATE  =  'to  show'  +  {ADV}  +  {NP  [OBJ/PATIENT]}  +  {PP} 

PRESENT  =  'to  show'  +  (NP)  +  ADV 

The  verb  SHOW  has  two  senses,  with  one  sense  occurring  only  one  definition.  The  base  sense  is 
INDICATE.  For  example,  the  word  'chart'  is  defined  as  'show  [INDICATE]  on  a  chart  [LOC-GOAL].' 
Or,  the  word  'relate'  is  defined  as  'to  show  [INDICATE]  a  connection  between  [RELATIONSHIP. 

The  other  sense  is  PRESENT  and  the  example  word  defined  for  this  sense  is  'rerun',  'to  show 
[PRESENT]  a  film  or  recorded  broadcast  [OBJECT]  again  [FREQUENCY].' 

TAKE  (defining  verb  in  90  definitions) 

POSSESS=  'to  take'  +  {NP  [ABSTRACT  OBJECT/PATIENT]}  +  {PP} 

CAUSE  TO  POSSESS  =  'to  take'  +  {NP  [OBJECT/PATIENT]}  +  {PP/ADV} 

REMOVE  =  'to  take'  +  {NP  [OBJ]}  "away/off/out/from/back"  +  {NP  [OBJECT/PATIENT]}  +  {PP/ADV} 
PARTICIPATE  =  'take  part  in'  +  Noun  Phrase  [ABSTRACT  OBJECT]  +  {P.P.} 
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INGEST  =  'to  take'  +  {NP  [OBJ]}  +  'in/into/through'  +  {NP  [LOG-GOAL]}  +  [PP] 

CHOOSE  =  'to  take'  +  NP  [OBJ] 

GRASP  =  'to  take  hold"  +  (ABST  OBJ]  +  (ADV/PP) 

The  verb  TAKE  can  convey  seven  senses,  four  of  which  are  idiomatic.  The  POSSESS  sense  is  the 
simplest  and  most  frequent.  In  the  definitions  conveying  the  POSSESS  sense,  'to  take'  is  most 
usually,  but  not  always  followed  by  a  NP  which  is  the  ABSTRACT  OBJECT  or  PATIENT  of  the 
POSSESS  sense.  For  example,  the  verb  'conquer'  conveys  the  POSSESS  sense  via  the  definition  of 
'to  take'  (land)[OBJ]  by  force  [MEANS]. 

The  CAUSE  TO  POSSESS  sense  of  TAKE  covers  those  definition  senses  in  which  the  POSSESS  act 
is  a  bit  more  removed  as  in  the  definition  of  'colour'  where  'to  take  on  [CAUSE  TO  POSSESS] 
takes  as  its  OBJ,  'colour'. 

The  sense  of  the  verb  TAKE  which  is  REMOVE  occurs  in  definitions  which  can  contain  one  of 
several  different  prepositions.  For  example,  the  word  'deduct'  is  defined  as  'to  take  away 
[REMOVE]  an  amount  [OBJECT]  from  a  total  [LOC-SOURCE].'  Or  the  word  'undress'  is  defined  as 
'to  take  [REMOVE]  the  clothes  [OBJECT]  off  someone  [PATIENT].' 

Three  of  the  idiomatic  senses  of  TAKE  contain  specific  wording  in  their  RRF.  They  include: 
PARTICIPATE,  INGEST,  CHOOSE  and  GRASP. 

The  first  idiomatic  sense  PARTICIPATE  includes  the  phrase  'take  part  in'  in  each  definition.  For 
example,  the  word  'join'  is  defined  as  'to  take  part  in  [PARTICIPATE]  an  activity  [ABSTRACT 
OBJECT]  as  a  member  of  a  group  [MANNER].' 

The  second  idiomatic  sense  of  TAKE,  INGEST  includes  one  of  the  prepositions  'in/into/through 
in  the  definitions.  For  example,  the  word  'inhale'  is  defined  as  'to  take  [INGEST]  cigarette  smoke 
[OBJECT]  into  the  lungs  [LOC-GOAL].'  Or  the  word  'sniff'  is  defined  as  'to  take  [INGEST]  a 
harmful  drug  [OBJECT]  through  the  nose  [LOC-PATH].' 

The  third  idiomatic  sense  CHOOSE  does  not  contain  any  specific  wording  in  the  RRF,  but  is 


7-23 


dsfinitely  idiomatic.  For  example,  the  word  'fork'  is  defined  as  'of  a  person  [AGENT]  to  take 
[CHOOSE]  one  fork  of  a  road  [OBJECT].' 

The  fourth  idiomatic  sense  GRASP  contains  the  words  'to  take  hold  in  some  format  in  the 
definitions.  For  example,  the  word  'grip'  is  defined  as  'to  take  hold  [GRASP]  of  the  attention 
[ABSTRACT  OBJECT].' 

USE  (defining  verb  in  32  definitions) 

EMPLOY  =  'to'  +  [ADV]  +  'use'  +  {ADV}  +  {NP  [OBJ]}  +  {PP} 

CONSUME  =  'to  use'  +  {'up'}  +  [NP  [OBJ]}  +  {PP/ADV} 

The  verb  USE  has  two  senses,  one  of  which  is  idiomatic.  The  base  sense  is  EMPLOY  and  the 
idiomatic  sense  is  CONSUME.  For  example,  the  word  'defend'  is  defined  as  'to  use  [EMPLOY] 
arguments  [OBJECT]  to  support,  protect,  or  show  the  rightness  of  [REASON].'  Or,  the  word 
'waste'  is  defined  as  'to  use  [EMPLOY]  wrongly  [MANNER].' 

The  definitions  for  the  idiomatic  sense  CONSUME  usually  contain  the  word  'up.'  For  example,  the 
word  'exhaust'  is  defined  as  'to  use  up  [CONSUME]  completely  [DEGREE].'  But  CONSUME  is  also 
used  for  the  word  'drink'  which  is  defined  as  'to  use  [CONSUME]  alcohol  [OBJECT].' 


Conclusion 

The  development  of  a  methodology  for  producing  both  syntactic  and  semantic  level  RRF  which  can 
produce  semantically  tagged  representations  of  definitions,  as  well  as  the  RRFs  themselves, 
will  aid  in  bringing  the  field  of  computational  linguistics  much  closer  to  its  goal  of  developing 
automatic  procedures  for  extracting  semantic  relations  from  text.  As  all  NLP  systems  are 
reliant  on  large,  rich  lexicons  of  the  common  vocabulary  of  everyday  communication,  this  effort 
provides  a  very  valuable  resource. 

The  work  will  be  continued,  hopefully,  if  time  and  resources  allow.  What  needs  to  be  done 
immediately  is  an  empirical  testing  of  the  RRF  on  the  other  half  of  LDOCE  and  then  on  other 
dictionaries  to  determine  the  generalizeability  of  the  RRF  herein  developed. 
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Exist  Exist  Exist  Exist  Cause  to  Be  Cause  to  Be  Cause  Cause 

(Characterist  (Condition/  (Role/Func  (State) 

ic)  Position)  tion) _ 

AdJ/Adv  +  {Verb}  'In/as/at/on'  +  NP  +  (PP)  Past  Part.  + 

+  {NP}  +  {PP}  NP  {PP/Adv} 


APPENDIX  1  -  Verb  Matrix 


Change  Change  Change  Cause  to  Transfer  Effect  (State)  Effect  (Act)  Provide 
Place  Position  Change 

Place 


TAKE 


Provide  Give  Put  Place  Possess  Cause  to  Experience  Perform  Present 

Possess 


(NP)  +  {PP}  {NP}  +  {PP/Adv} 


dicate  Push  Employ  Remove  Produce 


{NP}  +  'away/off 
/out/(rom/back‘ 
{NP}  +  (PP/Adv) 


IDIOMS 


Return 

o  + 

Sir- 

ill 

3.  2-:  >  a. 

Depart 

O. 

+ 

P  i 

1 

Grasp 

1 

'hold'  +  {NP} 

+  {ADV/PP} 

CD 

CC 

a 

'o 

CO 

0- 

0. 

2 

+ 

c  ^ 

—  Q. 

r  0. 

«  ^ 

P-  + 

o 

CO 

o 

o 

<0 

(0 

< 

‘a  particular 
meaning  on' 

Combine 

'together'  + 
(PP) 

Cover 

b 

+  1  ^ 

Q.  ° 

j.  ®  ^ 

^11 

elT 

Birth 

'birth'  + 

(Adv)  +  NP 

Forgo 

+ 

'O 

< 

az 

3 

Emit 

+ 

2 

+ 

3  ^ 

P  Sn. 

Represent 

sign, picture, 
symbol  {of}  + 
{NP)  +  {PP} 

BRING 

GIVE 

QD 

HAVE 

PLACE 

Q. 

TAKE 

USE 
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APPENDIX  2  -  Idiom  Matrix 


Consume  Inqest  Choose  Exceed 
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Abstract 

A  tunable  ring  laser  and  an  optical  amplifier  based  on  erbium  doped 
silica  fiber  as  a  gain  medium  were  developed  to  be  used  for  the 
wavelength  division  multiplexing  system  in  fiber  optic  communications. 

The  fiber  ring  laser  uses  a  GRIN  lens  and  a  Fabry-Perot  etalon  to  tune 
over  a  range  of  23  nm  wavelength  and  is  centered  around  1.5316  nm.  The 
laser  was  pumped  by  a  strained  quantum  well  InGaAs  laser,  which  delivered 
20  mw  of  power  in  the  fiber  with  a  consequential  output  power  of  10 
micro-watt.  The  ring  laser  was  based  on  a  4.75  meter  erbium  doped  fiber 
amplifier  developed  first  to  establish  an  experimental  model.  Peak  gain 
of  20  db  was  obtained  for  the  amplifier. 
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OPTICAL  COMMUNICATION  SYSTEM  BASED  ON 
TUNABLE  DOPED  FIBER  OSCILLATOR 


Salahuddin  Qazi 
and 

Kenneth  J.  Teegarden 


INTRODUCTION 

Optical  amplifiers,  based  on  erbium  as  dopant  in  a  single  mode 
silica  fiber,  are  increasingly  used  to  amplify  weak  signals  due  to 
attenuation  created  by  long  fiber  transmission  distances.  The  amplifier 
is  also  finding  applications  in  lightwave  networks  operating  in  a 
broadcast  mode  where  the  signal  from  the  central  station  is  split  into 
many  outputs  to  go  to  multiple  stations.  A  fiber  amplifier  can 
compensate  for  losses  as  the  signals  are  split  and  also  for  propagation 
losses  in  the  branches  of  network  over  extended  distances  of  10  to  25 
kilometers,  permitting  the  range  of  the  network  to  be  increased  (1). 

Gain  in  the  fiber  amplifier  is  based  on  a  certain  erbium  transition  at 
around  1.55  micron  which  has  attracted  great  interest  as  a  result  of  its 
enormous  potential  for  amplifiers  in  the  third  telecommunications  window. 

Erbium  doped  fibers  have  also  been  used  as  optical  oscillators  or 
lasers,  employing  the  same  gain  medium  as  amplifiers,  but  their 
applications  in  optical  communications  is  not  as  widely  investigated 
(2) .  For  increasing  the  amount  of  information  to  be  transmitted  in  a 
wideband  optical  communication  system,  wavelength  division  multiplexing 
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is  used.  In  wavelength  division  multiplexing  a  number  of  laser  diodes  of 
of  different  wavelength  are  independently  modulated  as  a  signal  source, 
and  all  the  outputs  are  launched  into  a  single  fiber  using  optical 
couplers  (3)*  A  tunable  optical  laser,  based  on  the  erbium  doped  silica 
fiber,  can  be  effectively  used  as  a  signal  laser  for  different 
wavelengths  in  a  wavelength  division  multiplexed  optical  fiber 
communication  systems.  Erbium  doped  fiber  lasers  based  on  ring 
configuration  offer  a  large  variety  of  features  such  as  large  tuning 
range,  high  output  power,  single  longitudinal  mode  operation  and  mode 
locking  at  repetition  rates  byond  10  GHz  (4).  Such  lasers  are  proposed 
for  use  in  wideband  fiber  optic  communications  where  they  would  replace 
solid  state  diode  lasers  as  the  primary  signal  source. 

METHODOLOGY 

In  order  to  implement  an  all-fiber  communication  system  at  1.5 
micron,  it  was  essential  to  develop  and  construct  a  fiber  amplifier  and 
an  oscillator  based  on  erbium  doped  fiber  as  a  gain  medium.  This  was 
achieved  in  the  following  manner: 

1.  A  fiber  amplifier,  based  on  erbium  doped  fiber,  as  shown  in 
figure  1  was  implemented.  It  consists  of  a  strained  quantum  well  InGaAs 
laser  diode,  operating  at  O.98  micron,  used  as  a  pump  laser  and  a 
wavelength  multiplexer  which  combines  an  input  signal  at  I.5  microns  with 
the  pump  signal.  A  wavelength  demultiplexer  at  the  output  of  the 
amplifier  is  used  to  separate  the  amplified  output  signal  from  any 
remaining  pump  signal.  Light  from  the  pump  laser  is  suitably  collimated 
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and  focused  with  microscope  objective  onto  the  fiber  end  at  the  input  of 
the  multiplexer.  Erbium  doped  fiber  has  a  core  diameter  of  8  micron  and 
an  outer  diameter  of  100  micron,  and  was  supplied  by  GTE  Laboratories  of 
Massachusetts.  Various  lengths  of  fiber  were  used  to  obtain  suitable 
gain  from  the  fiber  amplifier.  The  overall  gain  of  the  amplifier  as  a 
function  of  pump  power  and  signal  power  for  the  pump  wavelength  was 
measured.  Two  methods  were  used  to  make  these  measurements.  They  are 
shown  in  the  experimental  set-up  of  figure  2. 

In  the  first  method,  the  peak  output  power  was  measured  by  using  an 
optical  spectrum  anlayser  for  a  given  peak  input  power.  In  the  second 
method,  a  commercially  available  lightwave  transmitter  at  1.5  micron  was 
direct  intensity  (amplitude)  modulated  with  a  33  MHz  sinusoidal  RF 
signal.  The  resulting  modulated  signal  was  used  as  an  input  to  the  fiber 
amplifier,  and  the  output  signal  was  taken  at  the  output  of  the  lightwave 
receiver.  The  input  and  output  signals  were  measured  using  an 
oscilloscope.  In  both  the  cases  the  input  signal  was  measured  at  the 
output  end  of  the  demultiplexer,  hence  losses  in  this  device  were 
excluded  from  the  determination  of  overall  gain.  Measurement  of  pump 
power  launched  into  the  amplifier  were  made  with  a  Newport  power  meter  at 
the  output  end  of  the  demultiplexer.  The  emission  spectrum  of  the 
amplifier  as  a  function  of  pump  power  was  measured  using  an  optical 
spectrum  analyser. 
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2.  A  fiber  ring  laser  as  shown  in  figure  3  was  constructed  using 
erbium  doped  fiber  as  the  gain  medium  at  wavelengths  around  1.5  micron. 
The  ring  or  the  feedback  loop  is  completed  by  connecting  90%  of  the 
output  of  the  fiber  amplifier,  via  the  output  coupler  (splitter),  to  the 
input  of  the  wavelength  division  multiplexer  with  a  signle  mode  fiber.  A 
GRIN  lens  and  a  Fabry-Perot  etalon  consisting  of  about  1  mm  microscopic 
slide,  is  included  in  the  feedback  loop  as  a  tuning  and  wavelength 
selecting  element  for  the  ring  laser.  Further  tuning  was  achieved  by  a 
second  Fabry-Periot  etalon  consisting  of  0.3  mm  glass  plate. 

Emission  spectra  of  the  laser  output,  with  and  without  feedback,  was 
obtained  by  using  an  optical  spectrum  analyser.  Output  of  the  laser  with 
GRIN  lens  cavity  and  Fabry-Perot  etalon  was  obtained  by  using  an  optical 
spectrum  analyser.  This  was  repeated  without  the  Fabry-Perot  etalon. 

RESULTS  AND  DISCUSSION 

Amplifier  -  Emission  spectrum  of  the  erbium  doped  fiber  amplifier 
pumped  with  20  mw  is  shown  in  figure  4.  The  spectrum  showed  a  periodic 
ripple  superimposed  on  the  slowly  varying  envelope  of  the  f lourescence , 
whose  shape  and  amplitude  depends  on  the  amount  of  pump  power  and  the  way 
pump  power  was  launched.  It  was  also  noticed  that  at  a  certain 
connector  position  at  higher  power,  lasing  (oscillations)  were  induced 
as  shown  in  figure  5-  These  oscillations,  which  causes  instabilities  in 
the  gain  spectrum  of  the  amplifier,  were  minimized  by  careful  alignment 
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of  the  connectors  and  also  by  using  optical  isolators  in  the  amplifier 
design.  The  overall  gain  of  an  amplifier  with  an  erbium  doped  fiber  of 
length  4.75  meter  is  shown  in  figures  6a  and  6b.  This  indicates  that  the 
peak  of  the  gain  curve  for  the  amplifier  lies  at  about  1.5303  micron, 
while  the  signal  wavelength  did  not  coincide  with  the  wavelength  for 
maximum  gain.  The  gain  obtained  from  the  ratio  of  the  output  to  the 
input  signal  is  about  9-0  or  9-5  db.  The  overall  gain  of  an  amplifier 
using  4.75  meter,  measured  by  this  method  as  a  function  of  launched  power 
at  0.98  micron  pump  power  for  an  input  signal  power  of  30  nw,  is  shown  in 
figure  7*  These  results  indicate  that  higher  gain  would  be  obtained  by 
using  fiber  of  length  greater  than  4.75  meters.  Therefore,  length  of 
fiber  was  increased  to  7-75  meters,  and  measurements  of  overall  gain  were 
made  by  applying  a  I.5  micron  optical  signal  modulated  with  a  33  MHz  RF 
signal.  The  results  of  these  measurements,  as  illustrated  in  figure  8, 
show  higher  gain,  strong  dependence  on  launched  power,  and  considerable 
saturation  at  maximum  signal  power.  Finally,  the  gain  of  the  erbium 
doped  fiber  amplifier  was  estimated  by  correcting  the  maximum  overall 
gain  obtained  for  losses  incurred  in  the  bulkhead  connectors  and  the 
fusion  splices  between  the  silica  fiber  patch  cords  and  the  erbium  doped 
fiber.  The  peak  gain  cross  section  of  the  fiber  was  about  twice  that  of 
the  cross  section  at  the  signal  wavelength.  The  peak  gain  of  the 
amplifier  with  longer  fiber  was  found  to  be  approximately  20  db. 
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Ring  Laser  -  The  emission  spectrum  of  the  fiber  laser  without  the 
feedback  loop  at  20  mw  pump  power  in  figure  9  shows  maximum  output 
centered  at  1.533  micron.  Figure  10  shows  emission  and  lasing  spectrum 
on  the  same  laser  output  with  the  feedback  loop,  indicating  the  start  of 
lasing  around  1.555  micron.  The  output  of  the  fiber  laser  with  a  single 
mode  silica  fiber  in  the  feedback  loop  shows  certain  instabilities 
(figure  11)  and  mode  mixing,  which  became  relatively  stable  when  the  GRIN 
lens  and  Fabry-Perot  etalog  using  a  microscopic  slide  of  1  mm  was 
included.  The  output  in  figure  12  showed  a  lesser  number  of  modes 
centered  aroung  1.5316  micron.  Further  tuning  of  the  fiber  laser  output 
was  achieved  by  adjusting  GRIN  lens  and  microscopic  slide  of  0.3  mm 
thickness  as  shown  in  figure  13.  The  results  in  this  case  show  the 
presence  of  only  two  modes  of  lower  power  (about  160  nw)  separated  by 
about  23  nm  wavelength,  which  is  the  tuning  range.  Fiber  laser  output  as 
a  function  of  pump  power  with  and  without  etalon,  as  shown  in  figure  l4, 
indicates  a  linear  rise  in  the  output  power  after  the  threshold  current. 
The  output  of  the  fiber  laser  was  also  observed  to  contain  an  unmodulated 
signal  with  a  relatively  narrow  spectmm  of  several  modes  and  a  periodic 
signal  consisting  of  a  series  of  short  pulses  associated  with  the 
relatively  large  spectral  linewidth.  This  behavior  is  due  to  the 
spontaneous  mode  locking  occurring  in  the  output  of  this  erbium  doped 
fiber  ring  oscillator. 
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CONCLUSIONS 


The  project  shows  that  the  gain  medium  used  in  fiber  amplifier  can 
also  be  used  in  fiber  lasers,  which  are  proposed  to  replace  solid  state 
lasers  as  a  signal  source  in  wavelength  division  multiplexed  fiber  optic 
communications  systems.  Although  higher  gain  in  the  fiber  amplifier  is 
desirable,  such  a  gain  produced  instabilities  at  the  amplifier  output. 

It  was  also  noted  that  some  oscillations  were  produced  at  the  output  of 
the  fiber  laser  due  to  bulkhead  connectors  used  in  the  fiber  ring 
oscillator.  The  output,  however,  became  more  stable  when  the  connectors 
were  replaced  with  fusion  splices.  The  manually  adjustable  Fabry-Perot 
etalon,  and  tuning  elements  used  to  select  wavelength  and  tune  the 
laser,  can  be  improved  by  using  commercially  available 
precision-controlled  components.  The  relatively  large  spectral  line 
width  associated  with  the  spontaneous  mode  locking  can  have  a  limiting 
effect  on  the  tunability  of  the  fiber  oscillator  used  in  wavelength 
division  multiplexed  communication  systems.  It  is  important  to 
understand  the  mechanism  of  such  mode  locking  and  study  its  limitations 
on  the  tuning  of  fiber  lasers. 
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Figure  1  Erbium  doped  fiber  amplifier  configuration 
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Figure  2  Experimental  setup  of  erbium  doped  fiber  amplifier  system 
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Fabry  Perot  Etalon 


Figure  3  Tunable  erbium  doped  fiber  ring  laser 
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Figure  4  Emission  spectrum  of  erbium  doped  fiber  amplifier 
(wavelength  versus  output  power) 
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Figure  5  Optical  spectrum  of  the  fiber  amplifier  showing  lasing 
at  high  pump  power 
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Figure  6a  Optical  spectrum  of  the  input  signal  to  the  fiber  amplifier 


Figure  6b  Optical  spectrum  of  the  output  signal  to  the  fiber  amplifier 
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Figure  7  Amplifier  gain  versus  pump  power  for  4  meter  fiber 
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Figure  8  Amplifier  gain  versus  pump  power  for  7,75  meter  fiber 
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Figure  9  Emission  spectrum  of  the  fiber  ring  laser  without  the  loop 
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Figure  10  Optical  spectrum  of  the  fiber  ring  laser  showing  emission 
'  and  lasing 
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Figure  12  Optical  spectrum  of  the  fiber  ring  laser  with  the 
tuning  elements 
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Figure  13  Optical  spectrum  of  the  fiber  ring  laser  showing  two  tuned 
modes  seperated  by  23  nanometer 


8-23 


Laser  output  in  micro-watt 


Pump  power  (arbitrary  unit) 


Figure  14  Laser  output  as  a  function  of  pump  power 
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ABSTRACT 


This  final  report  completes  the  analysis  of  the  problem,  begun  at  Rome  Laboratories  at 
Hanscom  Air  Force  Base  during  the  Summer  of  1991,  concerning  extremely  short-time  pulse 
excitation  of  a  parabolic  reflector  antenna.  We  consider  a  pulse  that  is  short  compared  to  the 
time  required  for  it  to  traverse  the  depth  of  the  reflector,  so  that  only  a  fraction  of  the  reflector 
is  illuminated  at  any  given  time.  The  best  means  of  analysis  of  this  transient  problem  is  the 
Finite  Difference  Time  Domain  (FDTD)  method. 

The  computer  code  written  during  the  summer  assignment  has  been  modified  and  improved, 
and  can  now  simulate  a  realistic,  short,  high-frequency  modulated  Gaussian  amplitude  profile 
cylindrical  pulse.  Field  distributions  were  previously  simulated  for  a  deep  parabola  with  an 
unmodulated  Gaussian  source  in  both  transmit  and  receive  modes.  The  current  results  show 
that  for  a  source  with  a  high-frequency  modulating  carrier,  the  two-dimensional  parabola 
behaves  just  as  expected,  with  no  strange  time  artifacts  causing  complex  behavior.  Thus, 
the  often  perceived  assumptions  of  differential  transient  reflection  are  inaccurate.  A  short- 
time  pulse  radiates  from  a  parabola  aperture  in  the  same  fashion  as  if  it  were  radiating  from 
comparably  sized  hole  in  a  conducting  screen. 

One  additional  accomplishment  of  this  project  is  in  graphics  visualization  of  highly  oscillating 
waves.  Because  of  the  richness  of  available  field  data,  it  had  previously  been  impossible  to 
depict  the  modulated  waves  travelling  in  space  and  time.  Using  a  smoothing  scheme,  the  wave 
envelope  is  now  available.  This  low-pass  spatial  filtering  allows  more  easy  interpretation  of 
large  scale  wave  propagation — an  important  aspect  of  this  radiation  analysis. 
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FDTD  ANALYSIS  OF  THE  RADIATION  PROPERTIES 
OF  A  PARABOLIC  CYLINDER 
ILLUMINATED  BY  A  VERY  SHORT  PULSE 

Carey  M.  Rappaport 

INTRODUCTION 

As  signal  generation  technology  improves,  the  need  to  understand  the  behavior  of  shorter 
and  shorter  pulses  becomes  increasingly  important.  When  the  duration  of  the  pulse  becomes 
comparable  to— or  shorter  than— the  typical  length  scale  of  an  antenna  system,  new  ways  of 
studying  electromagnetic  field  propagation  become  necessary. 

One  important  problem  that  has  recently  been  studied  [1,2]  is  the  excitation  of  a  parabolic 
reflector  antenna  by  a  pulse  that  is  shorter  than  its  focal  length.  The  traditional  analysis 
method  of  tracing  rays  is  brought  into  question  since  the  rays  do  not  all  illuminate  the  reflector 
simultaneously. 

In  fact,  Hansen  first  states  [1]  that  the  leading  and  trailing  edges  of  a  short  pulse  have 
different  antenna  patterns,  leading  to  a  differential  transient  radiation  characteristic.  The 
leading  portion  of  the  transmitted  wave  is  first  reflected  by  the  reflector  surface  in  a  circle 
nearest  the  vertex  while  the  reflector  rim  is  unilluminated,  while  subsequently,  the  trailing 
pulse  edge  has  left  the  vertex  it  still  illuminates  an  annular  region  near  the  rim.  See  Figme 
1.  The  conclusion  is  that  the  initial  radiation  pattern  is  that  of  a  smaller  diameter  aperture, 
while  the  final  pattern  is  that  of  an  annulus  with  constant  outer  diameter. 

This  view  was  later  reversed  [2]  with  an  argument  that  rays  diffracted  from  the  edges  of  the 
reflector  leave  the  antenna  first,  before  reflected  rays  from  the  vertex.  The  transient  pattern 
should  instead  be  that  of  an  annular  aperture  first,  followed  by  the  gradually  shrinking  circle. 

The  object  of  the  current  study  is  to  show  that  neither  conclusion  is  accurate,  but  instead  that 
the  parabolic  reflector  has  no  characteristic  transient  behavior  other  than  a  simple  aperture. 
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The  Finite  Difference  Time  Domain  (FDTD)  method  is  often  used  to  simulate  the  time 
behavior  of  short  pulses.  It  is  not  based  on  extending  frequency  domain  analysis  across  a 
broad  frequency  spectrum,  but  instead  formulates  the  wave  propagation  completely  in  the 
time  domain.  The  shorter  the  time  pulse,  the  easier  it  is  to  model  the  field  interaction. 

Although  it  must  be  cautioned  that  using  numerical  methods  to  prove  theoretical  assertions 
is  dangerous,  it  is  the  transient  behavior  that  is  of  concern  here,  and  only  a  procedure  which 
emphasizes  the  different  temporal  aspects  of  the  field  can  be  effectively  employed. 

A  previous  report  [3]  analyzed  both  transmit  and  receive  pulses  that  had  unmodulated  Gaus¬ 
sian  spatial  amplitude  profiles,  and  a  modulated  Gaussian  plane  wave  receive  pulse,  all 
assumed  to  be  transverse  magnetic  (TM)  polarization.  The  results  for  the  2-dimensional 
parabolic  reflector  were  consistent  with  long  established  intuition,  that  the  parabola  converts 
a  circularly  cylindrical  wave  to  a  plane  wave  regardless  of  the  time  duration  of  the  incident 
pulse. 

This  report  describes  the  final,  most  realistic,  and  most  important  case:  a  high-frequency 
modulated  short-time  Gaussian  pulse,  transmitted  from  a  source  at  the  parabola  focal  point. 
The  problem  is  again  cast  as  2-dimensional,  with  TM  excitation.  We  expect  a  relatively 
uniform  plane  wave  reflected  from  the  parabola. 
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THE  FDTD  METHOD  OF  WAVE  ANALYSIS 


To  study  the  transient  wave  behavior,  a  time  domain  numerical  method:  Finite  Difference 
Time  Domain  (FDTD)  is  used.  This  method  was  first  proposed  on  a  cubical  lattice  by  Yee 
[4],  and  then,  more  recently,  popularized  by  others  [5-7].  Other  Finite  Difference  techniques, 
which  make  use  of  triangular,  conformal  meshes  in  both  the  time  and  frequency  domains  have 
been  presented  [8-10]. 

The  idea  behind  FDTD  is  that  Maxwell’s  partial  differential  equations  are  replaced  by  multi¬ 
dimensional  centered  difference  equations  in  space  and  time.  First,  space  is  discretized  into 
two  sets  of  interlocking  cubical  meshes,  with  the  cube  corners  of  one  mesh  coinciding  with  the 
cube  centers  of  the  other.  The  magnetic  flux  of  Faraday’s  Law  is  solved  at  the  center  of  every 
cube  face  in  terms  of  the  electric  field  on  the  edges  that  bound  that  face;  and  the  electric 
flux  of  Ampere’s  Law  is  solved  at  the  center  of  every  cube  face  of  the  complimentary  mesh 
in  terms  of  the  magnetic  fields  on  the  edges  that  bound  it.  With  isotropic  media  the  electric 
flux  and  the  electric  field  are  related  by  a  scalar  permittivity  constant,  e,  so  that  knowing  the 
first  from  Ampere’s  Law  allows  the  computation  of  the  magnetic  fiux  from  Faraday’s  Law. 
The  field  values  on  one  mesh  are  computed  at  one-half  a  time  step  ahead  of  those  of  the 
other,  so  that  the  difference  equation  in  time  at  one  centered  space  point  can  be  equated  to 
the  difference  equation  in  space  at  the  one  centered  time  point  of  the  complimentary  mesh. 

One  important  detail,  which  must  be  carefully  addressed  for  exterior,  unbormded  geometries, 
is  simulating  the  wave’s  radiation  to  infinity.  Although  the  computational  domain  terminates 
at  the  end  of  the  array  lattice,  the  wave  must  act  as  if  it  were  continuing  outward.  In  other 
words,  the  wave  must  not  reflect  off  the  artificial  boundary.  The  boundary  must  absorb 
all  incident  waves;  hence  it  is  referred  to  as  an  Absorbing  Boundary  Condition  (ABC).  In 
principle,  no  numerical  condition  can  absorb  waves  from  every  incident  angle,  but  there  have 
been  several  ABC’s  proposed  which  absorb  those  near  normal  incidence  [11-15]  which  makes 
use  of  pseudo-differential  annihilation  operators.  A  new  ABC  method  which  is  based  on 
simulated  anechoic  chamber  absorber  has  recently  been  presented  [15].  This  method  appears 
to  be  preferable  for  very  large  scatterers  and  higher  frequencies. 
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NUMERICAL  SIMULATION  ANALYSIS 

For  the  pai'abolic  reflection  problem,  a  two-dimensional  FDTD  method  was  implemented  on 
a  vector-processor  enhanced  Digital  Equipment  Corporation  VAX  9000  computer.  The  ABC 
used  is  the  Engquist-Majda  [12]  planar  condition  on  forward,  backward,  right  and  left  edges 
of  the  computational  domain.  The  parabolic  segment  is  chosen  to  be  four  times  as  wide  as 
deep  {FjD  =  0.25).  This  is  typical  of  deep  reflector  antennas,  with  focus  in  the  plane  of  the 

reflector  rim. 

The  Courant  Condition,  that  the  time  step  multiplied  by  phase  velocity  of  the  wave  must 
be  less  than  the  spatial  step,  is  satisfied  by  selecting:  r  =  cAt/Ax  =  0.5.  This  means  that 
a  wavefront  propagates  n/2  space  steps  in  n  time  steps.  The  computational  domain  used  is 
600  by  600  space  steps,  so  for  a  pulse  to  travel  from  the  first  row  to  the  last  row  and  then 
back  again  would  require  2400  time  steps,  amounting  to  a  total  of  864,000,000  computations 
each  of  and  Hy.  It  is  clear  that  supercomputers  are  particularly  useful  for  this  type 

of  problem.  Massively  parallel  platforms  are  well  suited  to  FDTD  calculation,  since  the  three 
field  components  are  calculated  for  the  entire  grid  at  each  time  step. 

The  circular  cylindrical  source  pulse  is  a  Gaussian  modulated  with  a  cosine,  by  given  at  t  =  0 
by: 

E  =  z - p= — cosk{p  —  po)  U/ 

JL 
y  po 

and  for  magnetic  field: 

H  =  (f> - -  -  cos  k(p  —  Po) 

'  Y  po 

The  parameters  chosen  for  the  simulation  are:  Ax  =  Ay  =  A,  W  =  20A,  po  =  lOOA,  y  =  377 
Ohms,  and  the  modulating  wave  number,  k  =  it /8  (corresponding  to  16  grid  points  per 

wavelengths). 

These  initial  wave  conditions  are  approximations  to  an  actual  cylindrical  wave  at  a  given 
time  sample.  They  are  the  asymptotic  (farfield)  cylindrical  pulse  wave  solutions  to  the  wave 
equation,  normalized  to  unity  at  p  =  po-  The  wave  representations  do  not  have  to  be  exact  for 
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a  cylindrical  wave  to  propagate.  Once  the  algorithm  starts,  the  laws  of  wave  propagation  are 
strictly  followed.  Any  imperfections  in  the  initial  conditions  will  lead  to  an  inward-travelling 
wave,  or  a  change  in  the  outward  cylindrical  wave  shape.  In  fact,  a  small  inward-travelling 
wave  with  about  5%  of  the  initial  amplitude  is  generated.  This  counter-propagation  artifact 
can  be  eliminated  using  an  absorbing  circular  disk  of  a  matched  lossy  medium  with  a  smaller 
radius  than  ~  po  “  3IT  [16]  coaxially  located  with  the  pulse. 

Figure  2  shows  a  three-dimensional  surface  plot  of  the  electric  field,  Ez,  given  in  Equation 
(1).  Only  the  field  data  sampled  every  12  points  in  x  and  y  are  shown.  Clearly,  this  highly 
modulated  cylindrical  pulse  is  not  well  depicted.  In  particular,  the  wave  appears  discontinu¬ 
ous,  and  the  maximum  value  in  the  plot  of  Figure  2  is  only  85  %  of  its  true  value.  The  next 
section  describes  an  improved  graphics  alternative  which  alleviates  this  deficiency. 

The  parabolic  reflector  is  given  by  the  equation: 


where  /  =  200 A  is  the  focal  length.  It  extends  across  2/3  of  the  width  of  the  computational 
domain  with  an  aperture  of  400  spatial  points.  The  focal  point  is  positioned  at  the  center  of 
the  grid  at  point  {i,j)  =  (300,300). 
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GRAPHICAL  PULSE  SMOOTHING 

In  the  first  attempts  to  analyze  the  cylindrical,  modulated  Gaussian  pulse,  the  sheer  volume  of 
data  made  it  quite  difficult  to  visualize  the  wave  behavior.  The  standard  method  of  graphical 
display,  three-dimensional  surface  plots,  while  effective  in  showing  unmodulated  pulses,  fails 
when  a  high-frequency  carrier  is  present.  The  problem  comes  from  the  trade-off  between 
displaying  all  the  data,  resulting  in  an  impossibly  dense  grid  with  600  by  600  crossing  lines; 
and  an  insufficient  undersampling  of  the  data,  which  misses  many  of  the  essential  details. 
It  was  decided  that  a  50  by  50  3-dimensional  surface  grid  is  the  maximum  density  for  easy 
visualization,  implying  that  only  every  twelfth  point  is  displayed. 

An  effective  means  of  identifying  the  large-scale  behavior  of  a  modulated  wave  is  to  detect 
its  envelope  by  low-pass  filtering.  This  is  the  same  well-established  technique  sometimes 
used  in  AM  radio  signal  demodulation.  In  the  current  application,  the  wave  signal  field  data 
is  available  for  spatial  points  rather  than  as  a  function  of  time,  so  the  smoothing  involves 
low-pass  spatial  filtering. 

Rather  than  using  a  formal  Fourier  transform  or  convolution  to  filter  the  spatial  data,  the 
envelope  can  be  simply  detected  by  finding  the  absolute  values  of  the  modulation  peaks,  and 
then  connecting  these  values  with  straight  line  segments.  As  long  as  the  modulation  is  fast 
enough  so  that  the  changes  between  peaks  are  small  relative  to  the  peak  values,  using  straight 
segments  to  join  the  peaks  does  not  introduce  much  error.  There  are  5  peaks  within  the  full 
Gaussian  pulse  width,  2W,  so  the  error  is  minimal.  The  line  segments  are  found  in  both  x- 
and  y-directions,  yielding  first-order  finite  element  patches  covering  the  absolute  value  of  the 
modulated  2-dimensional  spatial  waveform. 

Once  the  surface  patches  are  found,  the  intermediate  sample  points  are  easy  to  find  using 
bi-linear  interpolation.  The  result  of  smoothing  and  interpolating  the  data  of  Figure  2  is 
shown  in  Figure  3.  Note  that  the  wave  is  now  circular,  continuous,  and  almost  attains  its 
true  maximum  value:  97.9  %.  This  plot  does  not  depict  the  modulation  at  all,  but  shows 
the  overall  envelope  shape.  Used  in  conjunction  with  Figure  2,  the  smoothed  data  gives 
considerable  information  about  this  highly  modulated  extremely  numerically  dense  wave. 
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NUMERICAL  RESULTS 


Figures  4  through  15  show  the  progression  of  the  modulated  cylindrical  pulse  as  it  expands, 
interacts  with  the  parabolic  reflector,  and  then  forms  a  backward  propagating  plane  wave. 
Each  page  shows  two  plots  of  the  same  time  “snap-shot”  of  the  wave.  The  even  numbered 
flgures  are  the  undersampled  surface  plots,  while  the  odd  numbered  figures  show  the  smoothed 
envelope  of  the  wave.  The  time  samples  occur  every  150  time  steps.  Note  that  the  rate  of 
expansion  of  the  wave  between  Figures  2  and  4  (for  example)  is  about  an  eighth  of  the  total 
computational  grid,  or  about  75  space  steps,  as  expected.  It  is  also  seen  that  even  though 
the  envelope  plot  is  much  more  smooth  than  the  modulated  plot,  the  Gaussian  is  still  quite 
narrow  compared  to  the  entire  domain,  and  as  such  appears  to  have  a  ragged  peak. 

At  time  step  300At  the  pulse  has  encountered  the  central  portion  of  the  reflector.  Figure  6 
clearly  shows  that  an  interaction  is  occurring,  but  one  must  inspect  the  envelope  in  Figure 
7  to  see  that  a  plane  wave  is  being  reflected.  The  maximmn  values  at  the  edges  of  the 
reflected  plane  wave  are  due  to  the  standing  waves  formed  when  an  inverted  reflected  peak 
constructively  interferes  with  the  negative  portion  of  the  incident  wave.  In  Figure  8,  the 
reflected  plane  wave  is  clearly  formed,  with  uniformity  across  a  single  modulation  peak.  Also 
visible  are  the  interactions  of  the  incident  wave  with  the  domain  boimdaries.  Most  of  the 
wave  incident  on  the  boundaries  is  absorbed  with  the  Engquist-Majda  ABC,  seeming  to  just 
vanish.  Figures  10  and  11  however,  show  that  there  are  sHght  reflection  from  these  edges.  At 
750At,  shown  in  Figures  12  and  13,  the  incident  cylindrical  wave  is  no  longer  apparent.  All 
that  remains  is  the  parabohcally  reflected  uniform  plane  wave  and  the  ABC  artifacts. 

This  sequence  of  plots  clearly  indicates  that  the  parabola  generates  an  almost  uniform,  trans¬ 
mitted  plane  wave  with  envelope  width  comparable  to  that  of  the  incident  source  radial  ptflse 
width,  much  as  would  be  expected. 
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CONCLUSIONS 


An  analysis  of  the  reflection  characteristics  of  parabolic  reflectors  to  short  pulses  has  been 
completed.  Contrary  to  published  theory,  the  parabola  does  not  produce  any  unexpected 
transient  effects  beyond  the  standard  dispersive  effects  of  free-space  propagation  of  a  short 
pulse. 

The  FDTD  method  was  used  to  analyze  the  transient  behavior  of  the  two-dimensional 
parabolic  reflector  illuminated  with  a  short  TM  pulse.  In  the  numerical  case  considered 
in  this  report,  as  well  as  in  the  three  previous  cases,  the  wave  behaved  as  expected,  focusing 
to  high  intensity  at  the  parabola  focal  point,  or  forming  a  planar  transmitted  phase  front. 
The  reflector  seems  to  have  no  differential  effects  on  the  various  time  samples  of  the  pulse 
the  reflection  of  the  initial,  leading  edge  of  the  pulse  joins  correctly  with  the  subsequently 
reflected  trailing  portions  of  the  pulse. 

There  will  be  the  same  transient  behavior  of  a  wave  pulse  for  parabolic  reflection  as  for 
propagation  in  free  space,  but  the  parabola  itself  does  not  cause  any  transient  non-uniformities 

in  phase. 

The  enhancement  of  envelope  detection  by  low-pass  spatial  filtering  improves  the  clarity 
of  graphic  visualization.  Using  this  option,  the  bulk  behavior  of  the  wave  is  more  clearly 
visible.  The  high  resolution,  detailed  images  are  still  required,  however,  to  give  the  more 
precise  information  required  for  careful  analysis  of  individual  wave  peak  interactions  with  the 
reflector. 

There  may  be  some  concerns  about  the  choice  of  a  two-dimensional  reflector  illuminated  with 
transverse  magnetic  field.  It  is  much  more  complicated  to  develop  the  three-dimensional 
FDTD  algorithms  needed  to  analyze  a  circular  paraboloid  of  revolution.  Furthermore,  the 
transverse  electric  (TE)  polarization  is  hard  to  model  accurately,  since  the  electric  field  be¬ 
comes  infinite  at  sharp  edges  (like  the  edge  of  the  reflector).  These  are  important  limitations 
of  the  current  study.  However,  the  claim  that  there  is  a  differential  transient  effect  due  to 
the  reflector  [1,2]  also  does  not  address  wave  polarization,  but  relies  solely  on  a  path  length 
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analysis.  For  this  transient  claim  to  be  valid,  it  must  apply  to  a  2  dimensional  parabolic 
reflector  and  to  all  polarizations.  Our  FDTD  analysis  provides  a  clear  counter-example  to 

the  claim. 
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Short  Time  Pulse 


t  <  t_,  t  >  t, 

Illuminated  Region  of 
Paraboloidal  Reflector 


Figure  1:  A  short  pulse  in  time,  and  the  different  illuminated  regions  of  a 
paraboloidal  reflector  surface. 
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Figure  3:  Low-pass  spatially  filtered  and  interpolated  data  of  Figure  2,  in¬ 
dicating  the  Gaussian  envelope  of  the  cylindrical  wave. 


mml 


Figure  7:  Smoothed 
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Figure  12:  Sampled  modulated  wave  at  t  =  750At. 


Figure  13:  Smoothed  version  of  the  wave  at  <  =  750At 
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Abstract 


The  most  popular  method  for  training  neural  networks  is  the  error  back- 
propagation  algorithm.  In  principle  this  method  of  training  ought  not  to  be  applicable 
to  many  widely  used  networks  because  the  functions  which  describe  them  are  not 
continuously  differentiable  (C^).  In  practice  this  difficulty  can  be  overcome  by 
constructing  a  second  neural  network  which  approximates  the  behavior  of  the  first 
network  but  having  the  property  that  the  functions  which  describe  the  behavior  of  the 
second  network  are  functions.  One  can  train  the  second  network  and  transfer  the 
results  of  the  training  to  the  first  network.  Systematic  methods  of  constructing  such 
approximating  networks  were  studied.  In  the  case  where  the  original  network  is 
describable  in  terms  of  continuous  functions,  particular  attention  was  paid  to  a  method 
of  constructing  approximations  by  the  use  of  convolutions.  In  the  case  where  the 
original  network  is  describable  by  discontinuous  functions,  it  was  found  that  there  are 
fundamental  mathematical  obstacles  to  constructing  sastisfactory  approximations,  and 
some  progress  was  made  in  overcoming  these  obstacles. 
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APPROXIMATION  TECHNIQUES  FOR  INVESTIGATING 
AND  TRAINING  NEURAL  NETWORKS 


Michael  D.  Taylor 


INTRODUCTION 

Neural  nets  have  emerged  as  a  possible  alternative  or  complement  to  standard 
computer  technology.  They  are  believed  to  have  a  potential  for  performing  such  tasks 
as  image  processing,  pattern  recognition,  pattern  reconstruction,  and  optimal  path 
determination  in  a  swifter  and  more  satisfactory  fashion  than  standard  computers.  Two 
well-known  introductions  to  the  area  are  [6]  and  [12],  and  a  good  survey  of  recent 
advances  can  be  found  in  [9]  and  [10] . 

One  very  important  characteristic  of  neural  nets  and  one  reason  for  hoping  they 
can  perform  some  otherwise  difficult  tasks  is  that  neural  nets  have  the  potential  to  be 
trained  rather  than  having  to  be  programmed.  The  most  widely  used  method  of 
training  neural  nets  is  the  error  back-propagation  algorithm.  The  back-propagation 
algorithm  has  been  described  many  times  in  the  literature  on  neural  nets.  (See,  for 
example,  [13]  or  [15].) 

The  description  of  the  back-propagation  algorithm  has  usually  been  given  with 
certain  restrictions  in  mind,  namely  that  computations  of  cell  states  will  involve 
multiplying  connection  weights  by  other  cell  states,  adding  a  number  of  results  of  this 
kind  together,  then  adding  (or  subtracting)  a  “threshold”  value,  and  finally  applying  a 
so-called  “squashing  function”  or  “transfer  function”  to  the  result.  That  is,  the  value  of 
the  zth  cell  in  the  network  is  given  by  a  function  of  the  form 

j 

where  the  sum  is  taken  over  all  j  for  which  the  jth  cell  has  an  input  to  the  ith  cell. 
This  means  that  a  very  particular  architecture  is  being  assumed.  However  it  is 
straightforwaxd  to  extend  back-propagation  to  layered,  feedforward  networks  of  a  more 
general  sort  than  those  which  are  usually  considered. 
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There  is  also  a  certain  technical  restriction  on  the  method  of  back-propagation: 
The  algorithm  requires  that  the  ways  in  which  units  of  a  neural  net  depend  on  one 
another  all  be  describable  in  terms  of  what  are  called  continuously  differentiable 
functions  (also  known  as  functions).  Not  all  nets  meet  this  requirement,  so  that  any 
attempt  to  apply  back-propagation  to  them  lacks  theoretical  justification. 

It  should  be  noted  in  connection  with  these  last  two  points  that  there  is 
recognition  in  the  neural  network  community  of  the  desirability  of  studying  neural 
networks  with  nonstandard  architectures.  This  point  is  specifically  brought  out  in  the 
DARPA  study  on  neural  networks,  [3].  Another  example  occurs  in  [11]  where 
nonstandard  architectures  of  interest  in  image  processing  are  mentioned  as  worthy  of 
further  investigation;  this  last  example  makes  use  of  operations  typical  of  fuzzy  analysis. 

A  particular  motivation  for  this  work  is  interest  in  neural  nets  having  an 
architecture  suggested  by  concepts  occuring  in  fuzzy  analysis.  Fuzzy  analysis  is  a 
method  of  dealing  with  uncertainty,  particularly  uncertainty  arising  from  vagueness 
rather  than  randomness.  (Example:  When  is  a  pear  ripe?  There  is  more  than  one 
possible  answer.)  There  has  recently  been  a  great  deal  of  interest  in  using  fuzzy  analysis 
in  control  and  classification  problems;  for  example,  as  described  in  [7],  [1],  and  [5],  an 
automatic  train  operating  system  has  been  successfully  tested  in  Japan,  fuzzy  control 
logic  has  been  applied  to  a  glass  melting  furnace,  and  fuzzy  set  theory  has  been  used  for 
classifying  pyrometallurgical  industry  products.  Fuzzy  analysis  has  also  been  turned  to 
the  problem  of  pattern  recognition  (see  [2])  with  possible  applications  to  military 
intelligence  and  medical  data. 

Very  little  seems  known  about  neural  net  architectures  which  derive  from  fuzzy 
analysis  concepts.  (But  an  example  of  such  an  architecture  can  be  seen  in  [8].)  One 
difficulty  in  investigating  such  architectures  is  that  the  functions  one  typically  uses  in 
fuzzy  analysis  (maximum  and  minimum)  are  nondifferentiable.  This  means  the 
application  in  this  area  of  the  back-propagation  algorithm,  our  most  widely  used  tool  in 
training  a  neural  net,  lacks  firm  theoretical  justification. 


THE  PROBLEM 

Working  in  the  1991  AFOSR  Summer  Research  Program,  [14],  this  author  began 
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to  develop  a  way  —  at  least  in  theory  —  to  extend  the  back-propagation  algorithm  to 
many  networks  for  which  its  application  would  not  normally  be  justified.  The  idea  is 
essentially  this:  Suppose  we  are  given  a  neural  net  whose  workings  are  described  by 
continuous  functions.  We  construct  a  second  neural  net  whose  behavior  approximates 
that  of  the  first  net  very  closely  —  as  closely  as  we  may  wish  —  but  whose  workings  are 
described  by  (that  is,  continuously  differentiable)  functions.  Let  us  call  this  second 
network  a  network.  (The  fact  that  the  approximating  network  can  be 
constructed  is  a  straightforward  application  of  some  well-known  facts  from 
mathematical  analysis.  A  proof  of  the  construction  and  a  particular  way  to  carry  it  out 
are  given  in  [14].)  The  back-propagation  algorithm  can  now  be  applied  to  train  the  C* 
network.  The  results  of  this  training  —  connection  weights  and  threshold  values  of 
cells  —  can  then  be  transferred  to  the  original  network.  Since  the  two  networks  behave 
in  very  similar  ways,  the  original  network  then  behaves  as  though  trained. 

Two  remarks  are  now  appropriate. 

First,  in  the  work  just  described  these  approximation  techniques  have  been 
studied  only  in  the  case  when  the  neural  network  is  describable  by  continuous  functions. 
The  case  of  discontinuous  functions  was  not  examined.  Yet  the  idea  of  approximating  a 
neural  network  by  a  network  and  training  the  network  by  back-propagation  is 
implicit  in  the  literature  from  early  on,  and  it  is  precisely  this  case  in  which  the 
functions  are  discontinuous  which  is  dealt  with.  (See  for  example  [13]  in  which  the 
nodes  of  the  networks  take  on  only  the  values  -fl  and  —  1.)  Moreover  this  discontinous 
function  case  is  one  in  which,  as  we  shall  show  later,  serious  difficulties  might  emerge. 

Second,  we  may  now  formulate  the  main  goals  of  this  work.  On  the  one  hand  we 
wish  to  find  a  way,  given  a  neural  network,  to  construct  an  approximating  network 
and  to  be  able  to  state  with  confidence  what  sort  of  approximation  is  meant  and  how 
good  it  is.  On  the  other  hand  we  wish  to  develop  a  means  of  investigating  nonstandard 
neural  net  architectures,  particularly  those  suggested  by  concepts  from  fuzzy  analysis. 
Achievement  of  the  first  goal  can  be  used  as  a  means  of  attaining  the  second  goal. 
Much  progress  has  been  made  toward  the  first  goal  in  the  continuous  case.  Only  partial 
progress  has  been  made  in  implementing  this  work  in  computer  simulations,  in 
investigating  the  discontinuous  case,  and  in  the  investigation  of  nonstandard 
architectures. 
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THE  CONTINUOUS  CASE 

Suppose  we  are  dealing  with  a  feed-forward,  multilayer,  synchronous  neural 

network.  Suppose  the  input  layer  has  Nq  cells,  the  next  layer  has  the  next  N2,  and 

so  on  to  the  output  layer  which  has  Nj.  cells.  We  assume  the  state  of  each  cell  is  given 

by  a  real  number  (or  by  an  element  of  some  subset  of  the  real  numbers  such  as  {  —  1,1}) 

N- 

so  that  the  state  of  the  ith  layer  at  time  t  may  be  considered  to  be  a  point  in  [R  b 
There  must  be  a  function  Ff  R  *  —  1  IR  *  with  the  property  that  if  x  is  the  state  of 
the  (i  —  l)th  layer  at  time  t,  then  Fj{x)  is  the  state  of  the  ith  layer  at  time  t  +  1.  It 
follows  that  the  neural  net  architecture  is,  in  effect,  described  by  a  sequence  of  functions 
thus: 


Notice  that  what  one  might  call  the  total  response  or  ultimate  response  of  the  network, 
namely  the  output  arising  from  a  given  input,  is  given  by  the  composite  function 
Fr.oF^_^o...oF^. 


What  we  call  the  continuous  case  is  the  one  in  which  all  the  F^'s  are  continuous 
functions.  This  takes  in  functions  such  as  max  and  min  which  occur  in  fuzzy  analysis 
(hence  by  implication  some  interesting  nonstandard  architectures)  but  not  such 
functions  as 


sgn(x) 


—  1  if  X  <  0 
1  if  X  >  0 


which  occurs  quite  frequently  in  the  literature. 

The  back-propagation  algorithm  can  be  applied  provided  each  F^  is  a 
function.  In  practice  this  requirement  is  often  not  satisfied.  The  usual  response  in  the 
literature  to  such  a  situation  seems  to  be  the  replacement  of  each  F^  by  a  function 
G-  which  behaves,  in  some  sense,  approximately  the  same  as  F^.  For  example,  sgn(x) 
might  be  approximated  by 


ax _ 1 

9a{x)  =  lax  I  where  a  >  0. 

The  replacement  is  apparently  done  on  an  ^  hoc  basis;  experimentation  shows  which 
replacements  work  and  and  which  do  not.  In  order  for  this  procedure  to  be  a  reasonable 
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one,  it  is  essential  that  o  ^  o  •  •  -  o  be  closely  approximated  by 
Gj.oG^_-^o---oG-^.  As  we  shall  show  in  the  next  section,  situations  can  easily  arise  in 
which  this  does  not  happen. 

In  [14]  a  particular  class  of  approximations  is  explored  in  a  systematic  way  for 
what  is  essentially  the  continuous  case.  We  describe  this  procedure  in  a  somewhat 
simplified  form:  For  each  F^-  construct  a  function  (f)-:R  *  “  IR  satisfying 
suppi^i  £  ■ 

^i>0, 

and 

/i  =  1, 

where  supp  is  the  support  of  and  6^  _  j  is  a  given  positive  number.  Then  set  G^  = 
where  *  is  the  operation  of  convolution.  (Since  F-  is  vector- valued  and  (j)-  is  real¬ 
valued,  F^*<l)-  is  the  vector-valued  function  obtained  by  convoluting  every  component  of 
Fj  with  <f>j^  .)  In  [14]  it  was  shown  how,  by  properly  choosing  the  S-'s,  one  may 
construct  G^-’s  so  that  o  G,.  _  o  •  •  •  o  differs  in  norm  from  F^  o  F,.  _  o  •  •  •  o  F]^  by 
less  than  any  previously  assigned  e.  (The  norm  used  in  [14]  was  the  sup  norm.  That  is, 
if  a:  =  (xj^,X2,*--,a:„),  a  point  in  IR”,  then  the  sup  norm  of  x  is  |lx||  = 

sup{  ]  Xj- 1 : 1  <  i  <  n}.  In  effect  this  measures  the  difference  in  performance  between  two 
networks  by  comparing  the  values  on  corresponding  cells,  a  pair  at  a  time,  and  taking 
the  norm  to  be  the  largest  magnitude  of  the  difference  of  values  of  corresponding  cells. 
This  might  also  be  thought  of  as  an  norm.) 

In  a  somewhat  expanded  version  of  this  work  which  has  been  submitted  to  the 
Air  Force  as  a  technical  report,  [4],  it  is  brought  out  that  there  is  a  difficulty  in  running 
computer  simulations  of  these  G^  approximation  techniques.  Convolutions  tend  to  be 
procedures  which  require  a  great  deal  of  computation.  Since  training  a  neural  network 
by  back-propagation  can  require  a  huge  number  of  training  runs,  to  combine  this 
procedure  with  convolutions  is  to  create  a  process  which  takes  fax  too  long  to  run  on  a 
computer.  However  there  are  several  things  which  can  be  done  about  this. 

First  of  all  one  can  restrict  one’s  attention  to  the  special  but  still  interesting  case 
in  which  the  state  of  the  ith  cell  is  given  by 

j 

where  the  sum  is  taken  over  all  j  for  which  the  jth  cell  has  an  input  to  the  ith  cell. 
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Assuming  that  /  is  a  continuous,  nondifferentiable  function,  one  can  then  form  a 
convolution  not  with  /  ^  function  of  many  variables,  but  with  /,  a 

function  of  a  single  variable.  Not  only  does  this  greatly  speed  up  the  process  of  running 
a  computer  simulation,  it  turns  out  that  the  theorem  in  [14]  which  estimates  the 
“goodness”  of  approximation  of  the  approximating  network  still  holds  with  the  same 
epsilons  and  deltas.  This  strategy  has  been  successfully  implemented  in  [4]  for  the  XOR 
problem  using  for  /  the  function 

0  if  x  <  0 

/j.(x)  =  <  X  ifO<x<l 

1  if  1  <  X. 

Second  it  should  be  noted  that  this  difficulty  we  are  considering  does  not  say 
anything  is  wrong  with  the  theory  of  using  convolutions  to  construct  approximating 
networks,  only  that  it  is  hard  to  simulate  the  application  of  such  a  theory  on  a  serial 
computer.  One  might  try  to  find  particulary  fast  ways  to  compute  convolutions. 
Alternatively,  when  dealing  with  a  specific  neural  net  architecture  and  set  of  functions 
defining  cell  states,  one  might  hope  to  construct  unusually  efficient  convolution 
techniques  for  that  particular  case.  Finally  one  might  consider  constructing  special 
hardware  to  implement  the  convolution/ approximation  concept;  there  exist  in 
engineering  literature  well-developed  digital  and  analog  techniques  for  computing 
convolutions  with  great  rapidity. 

Third,  the  use  of  convolutions  is  only  one  possible  technique  for  constructing 
approximating  networks.  Its  great  virtue  is  its  generality  and  the  ease  of  theoretical 
analysis.  There  are  other  means  of  constructing  approximations  to  given  functions. 
Here  are  some  examples  of  approximations  of  functions  of  interest  in  the  study  of  neural 
networks: 

Recall  that 

0  if  X  <  0, 

/j.(x)  =  <  X  if  0  <  X  <  1, 

1  if  X  >  1. 


H{x) 


0  if  X  <  0, 
1  if  X  >  0. 
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sgii(a:)  = 


—  1  if  x  <  0, 
1  if  a;  >  0. 


fj'  is  the  threshold  function,  H  is  the  Heaviside  function,  and  sgn  is  the  signum 
function. 

It  is  widely  recognized  that 

H{x)  ~  ^  where  a  >  0. 

In  this  case  the  approximation  is  not  to  be  understood  as  sup  norm  or  uniform,  since  H 
has  a  discontinuity  at  0,  but  in  some  other  sense,  perhaps,  for  example,  as  an  i,^ 
approximation  or  in  terms  of  convergence  in  measure  or  pointwise.  This  particular 
approximation  is  widely  used  in  the  study  of  neural  nets. 

Since  sgn(x)  =  2H(x)  —  1,  we  have 

0  1 
^  ^  g  -  aa:  -  1  where  a  >  0. 

Again  the  approximation  should  be  probably  be  understood  in  an  sense  or  in  terms 
of  convergence  in  measure  or  as  pointwise  convergence. 

Both  min  and  max  (or  A  and  V )  axe  of  interest  in  fuzzy  analysis  and  in  some 
areas  of  application  such  as  image  processing.  Note  that 


These  should  be  understood  as  pointwise  or  approximations  (with  respect  to  some 
given,  bounded  interval).  Since 

xAy  =  (1/2)  (x  +  y-  I  x-y  I ) 
xVy  =  (1/2)  (x  +  y  +  |x-y  I)  , 
this  leads  to  the  following  approximations  for  A  and  V  : 

"A!'  “  + 
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xW  y 


1  {-+!'+(- 


These  approximations  are  to  be  understood  in  a  pointwise  sense  or  i.°°  sense  with 
respect  to  some  bounded  interval. 

Finally  we  note  that 


aiiy  w t;  luclu 

fri^)  ~  ^  - x)  where  a  >  0. 

^  "  Vl  +  e“(^-W 


Again  this  is  a  pointwise  or  i.°°  approximation.  As  a  matter  of  fact,  if  we  set 


s„(x)  - 


then  it  can  be  shown  that 


I  -  SokA  I  =  S  Ki  I  e  <  i  • 


THE  DISCONTINUOUS  CASE 

If  we  are  given  a  neural  network  describable  by  a  sequence  of  functions  thus, 


r  oA'r 


where  the  Fj-’s  may  now  be  discontinuous,  then  we  wish  to  construct  an 
“approximating”  network 

iVn  Ni  ^2  N 

R  0  J  R  1  •••  -♦  R^r 

where  the  Gj’s  are  functions.  Presumably  each  will  be  approximated  by  in 
some  appropriate  sense.  Also,  it  is  absolutely  essential  that  Fj.  o  F^.  _  o  •  •  •  o  be 
approximated  by  o  1  o  ••  •  o  Gj.  What  is  a  reasonable  sort  of  approximation  to 
use?  Since  we  are  dealing  with  discontinuous  functions,  we  cannot  talk  about  uniform 
or  approximations.  We  present  some  examples  which  shed  a  little  light  on  the 
situation: 


Example  1.  Let 


0  if  a;  <  0, 
1  if  X  >  0, 
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and 


9a{^)  =  2  e  -  ax  where  a  >  0. 

It  is  readily  seen  that  as  a  -+  oo,  we  must  have  -*  H  in  i.^(A)  and  in  measure  (where 
the  measure  in  question,  A,  is  Lebesgue  measure  on  iR). 

Now  let  us  consider  what  happens  when  we  compose  these  maps.  Note  that 
H  o  H  —  H.  But  notice  that  if  a:  ^  0,  then  ga{x)  «  0  and  hence  [g^^  o  go^{x)  1/2. 
Thus  g^i  o  g^^  does  not  converge  io  H  o  H  \n  any  of  the  senses  considered  above. 

This  example  reveals  something  very  important.  Namely,  even  if  one  chooses 
very  reasonable  -looking  approximations  and  ,  one  cannot  be  sure 

that  ^2  o  w  ^2  o  G^.  This  seems  to  be  the  central  problem  in  constructing  neural 
net  approximations  in  the  discontinuous  case. 


Example  2.  Now  let 


sgn(x) 

and 


—  1  if  X  <  0, 
1  if  X  >  0, 


fai^) 


e“*-l 
e“^  +  l 


where  a  >  0. 


As  in  the  last  example,  as  a  oo,  we  see  that  sgn  in  i.^(A),  pointwise,  and  in 
measure. 

What  happens  with  the  compositions?  We  have  sgn  o  sgn  =  sgn.  As  regards 
fa  °  fat  first  that  it  must  be  an  increasing  function  with  range  between  0  and  1,  so 
there  is  some  hope  it  may  converge  to  sgn.  However  —1  <  /q;(  —  1)  <  {fa°fa){^) 
<  /a(l)  <  1)  which  implies  the  axea  lying  between  the  graphs  of  fa°fa  ^rid  sgn 
must  be  infinite  and  thus  there  can  be  no  convergence  as  a  -+  oo.  Now  suppose  x  is 
fixed  and  positive.  Notice  that 


(fa  °  fa){x) 


g“/ a(®)  _ 
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If  q:  ->  oo,  then  we  must  have 
fa{x)  -+  1, 
afaix)  oo, 

and  hence 

ifa°fa)i^)  1- 

Similarly  for  x  fixed  and  negative,  we  must  have  (/^  o  fa){x)  —  1  as  o:  oo.  It 
follows  from  this  and  the  geometry  of  the  graphs  that  fa°  fa  °  measure. 

This  example  seems  to  say  that  i,^  convergence  is  not  an  appropriate  choice. 
And  if  not  i,\  then  presumably  neither  is  JL^  for  some  other  choice  of  p.  Both 
pointwise  convergence  and  convergence  in  measure,  both  which  do  work  here,  have  nice, 
intuitive  meanings.  In  the  case  of  convergence  in  measure,  it  means  that  the 
approximating  function  has  almost  the  same  value  as  another  one  most  of  the  time. 


Example  3.  We  now  return  to  the  H  of  Example  1  and  produce  a  modification 
of  Pa  such  that  its  compositions  will  converge  pointwise  and  in  measure  to  compositions 
of  H.  As  a  matter  of  fact  we  shall  show  convergence  holds  for  fc-fold  compositions.  (By 
the  A:-fold  composition  of  F  with  itself,  we  mean  F  o  F  o  •  ■  -  o  F  where  F  occurs  k 
times.)  Convergence  in  i,^  fails  for  the  same  reason  as  given  in  Example  2. 

We  construct  the  function 

Cryfa;)  =  - ;r -  where  a  >  0. 

Note  that  ej^  must  be  an  increasing  function  whose  values  lie  strictly  between  0  and  1. 
Since  =  H,  we  need  only  show  that  e^^*^  H  in  measure.  Because  of  the  geometry 
of  the  graphs  of  the  functions,  it  is  sufficient  to  show  that  eQ^(O)  0  and  ->•  1  for 

X  fixed  and  positive  as  a  oo. 

We  show  by  induction  that  for  every  natural  number  k  we  have  ->■  0  as 

a:  oo.  (This  gives  us  ea^(O)  -►  0.)  Since 

9  2 

a"^ec^(0)  =  “^0  as  a  -♦  oo. 
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we  have  the  proposition  for  k  =  1.  Suppose  we  know  oP‘eJ^{0)  0  as  a  -♦  oo.  From  the 


definition  of  e^,  we  obtain 


2^  k  +  l(n\  _ 


a^e/  +  ^(0) 


which  by  the  induction  hypothesis  must  converge  to  0  as  or  oo. 

Now  let  X  be  fixed  and  positive.  We  induct  on  k  to  show  ej‘{x)  1.  This 
follows  easily  for  /;  =  1,  so  now  suppose  we  know  it  for  some  given  k.  Note  that 
a  —  a‘^ej^{x)  -*  —  oo  as  a  oo.  Since 


e  ^  +  =  _ I _ 

\  ^ct  (®) 

it  follows  that  e^^^  ^(a:)  -+  1  as  o;  oo.  This  establishes  convergence  pointwise  and  in 


measure. 


What  is  revealed  in  this  example  is  that  even  though  “reasonable”  choices  of 
approximating  functions  may  turn  out  to  give  bad  results,  there  may  be  other  choices 
which  give  good  results.  An  obvious  question  then  is,  how  can  one  know  beforehand 
that  a  choice  is  a  good  one?  Better  yet,  how  can  one  construct  good  choices? 

We  present  here,  in  partial  answer  to  this  question,  a  result  concerning 
compositions  and  convergence  of  approximations  to  discontinuous  functions. 

A  reasonably  general  class  of  functions  to  consider  is  the  set  of  /:IR->1R  which  are 
of  bounded  variation.  Every  such  function  can  be  decomposed  into  a  sum  f  =  g  +  h 
where  g  is  continuous  and  h  is  a.  step  function  with  at  most  a  countable  number  of 
discontinuities.  We  already  have  some  idea  of  how  to  deal  with  continuous  functions 
from  [14].  Recalling  the  defintion  of  the  Heaviside  function  H,  note  that  h  can  be 
written  in  the  form 

h{x)  =  ^  a^H{x  -  X  •)  +  b 
i 

where  b  and  the  a^’s  and  x^’s  are  constants.  We  will  look  at  functions  of  this  last  form 
and  possessing  only  a  finite  number  of  steps. 

Recall  that 
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Gfvfa;)  =  - rr -  where  a  >  0, 

^  _ _ /-i  i„.\\ 


or  equivalently 
ea(a;) 


^j(^)  =  {H  -  X--))  +  bj  for  i  =  1,  2,  •  •  •, 

I  =  1 


^j,  a(^)  =  (E  ^ij  “  ^ij))  +  for  i  =  1,  2,  •  •  •, 


where  or  >  0,  each  a-j  ,  x-j  ,  and  bj  is  a  constant,  and  n>2.  Let 


and 

^k,Q  ~  ^k,a°^k-l,a°"'^^l,a' 

Then  the  following  result  shows  that  the  Fj  ^’s  are  the  “right”  kind  of  functions 
with  which  to  approximate  the  Fj's: 

THEOREM.  hm^  for  all  X. 

NOTE:  It  can  be  seen  from  the  proof  that  the  convergence  is  faster  than  that  of 
a^/e“  as  a  -+  oo. 

Proof.  First  of  all  we  will  prove  by  induction  on  k  that 


Ur^  aqe„(Gjj.  „(i)  +  c)  -  /r(Gj(i)  +  c))  =  0 


where  c  is  an  arbitrary  constant. 
If  a;  <  0,  then 


q;2  (ea(a:)  -  H{xfj  =  e„(a;) 
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- — - 0  as  a  oo. 

l+e“-“  * 


If  X  >  0,  then 


a2(eQ(x)-F(x))  =  a2(ea(a:)-l) 


-+  0  as  a;  -♦  cx>. 


Hence 


(ea(3:)  -  =  0. 


From  this  we  see  that 

Sj^oo  (^1,  ~  ^  -H{x-  X -i))  =  0. 

1  =  1 

Next  notice  that  we  can  write 

q;-Q!2(Gi^^(x)  +  c)  =  Q:-a2(Gi(x)  +  c)  +  Q'2(Gi(x)-Gi^^(x))  , 
so  that  if  we  let  a  -+  oo,  we  obtain 

a  —  Q^  ^{x)  +  -♦  oo  if  ^^(x)  +  c  <  0 

and 

a  —  a^  (pi^  j^(x)  +  -►  —  oo  if  Gi(x)  +  c  >  0  . 

Furthermore  in  each  of  these  cases  the  divergence  is  at  least  as  rapid  as  that  of  a.  Now 
suppose  ^^(x)  +  c  <  0  and  consider  what  happens  as  a  oo.  We  must  have 

+  +  ^ 0 
^  ’  '  ,  ,  (Gi  tt(*)  +  c) 

1  +  e  ’ 

since  the  denominator  diverges  exponentially.  If,  on  the  other  hand,  G2^(x)  +  c  >  0  and 
we  let  a  -*■  oo,  then 


(e<j(Gi^  ^(x)  +  c)  -  H{G^{x)  +  c))  = 


^a-a2(Gi^  Jx)  +  c) 


again  because  the  denominator  diverges  exponentially.  Therefore  we  have  shown  that 
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(ea(Gi^£,(a:)  +  c)-if(Gi(x)  +  c))  =  0 

whicli  completes  the  first  step  in  the  induction. 

Now  suppose  we  know  that 

and 

+  -  ir(Gjt(x)  +  c))  =  0. 

It  follows  that 

{Pt  +  l,a(Gt,  J^))  -Fi^  l(Gt(rr)))  = 

^k+l 

JjToo  ^  \  k  +  a(^)  -  k  +  l)-  H{Gk{x)  -  X,.  ^  ^  i))  =  0. 

i  =  1 

It  follows  easily  from  this,  in  the  manner  exhibited  in  the  k  =  1  case,  that 

+  1,  a(^)  +  ^)  -  ^i^k  +  l(^)  +  c))  =  0. 

This  completes  the  induction. 

It  follows  trivially  from 

that 

^n,  a(^)  “  Gyi{x)  .  □ 

SUMMARY  AND  FURTHER  LINES  OF  INVESTIGATION  AND  ACTIVITY 

In  summary  then,  the  problem  of  constructing  approximating  networks  in  a 
systematic  fashion  and  the  properties  of  such  approximations  were  studied.  In  the  case 
where  the  original  network  is  describable  in  terms  of  continuous  functions,  a 


10-16 


theoretically  satistfactory  approximation  can  be  constructed  using  convolutions. 
However  it  is  difficult  to  perform  computer  simulations  of  this  particular  approximation 
method.  .41ternate  methods  of  constructing  approximations  in  the  continuous  case  were 
considered.  In  the  case  where  the  network  is  describable  in  terms  of  discontinuous 
functions,  fundamental  difficulties  of  a  mathematical  nature  arise.  A  partial  solution  is 
given  to  an  important  case  of  this  last  problem. 

On  the  basis  of  what  has  been  done  so  far,  some  interesting  lines  of  inquiry 
suggest  themselves. 

1.  Software.  There  is  need  to  develop  software  for  carrying  out  computer 
simulations  of  the  concepts  put  forward  here.  A  beginning  in  this  direction  has  been 
made  by  Captain  Jeff  Fordon  of  the  U.  S.  Air  Force  in  [4].  A  graduate  student  here  at 
the  University  of  Central  Florida,  Terence  Burning,  is  presently  at  work  to  extend 
Captain  Fordon’s  work. 

2.  “Fuzzy”  networks.  Depending  on  the  success  of  1,  neural  networks  which  are 
constructed  using  functions  from  fuzzy  analysis  should  be  trained  on  simple  problems 
such  as  the  XOR  problem,  addition  base  2,  encoding  problems,  etc.  This  would  serve  as 
a  test  of  the  network  approximation  techniques  and  simultaneously  yield  some 
knowledge  of  the  capabilities  of  nonstandard  neural  networks. 

3.  The  problem  of  spurious  minima.  One  of  the  difficulties  in  applying  back- 
propagation  is  the  occurrence  of  local  minima  which  produce  false  solutions  of  the 
training  problem.  The  construction  of  neural  nets  to  approximate  given  neural  nets 
is  a  smoothing  process.  It  is  possible  that  such  constructions  might  be  used  as  a 
technique  to  get  rid  of  local  minima.  (Following  a  remark  in  [13],  base  2  addition  is  a 
candidate  for  experimentation  here.) 

Let  us  elaborate  on  this  idea.  Suppose  we  wish  to  investigate  a  neural  net  N  and 
we  construct  a  family  of  neutcd  nets  having  the  property  that  0  <  t  <  oo,  that  each 
is  a  neural  net  for  f  >  0,  and  as  t-^0.  (This  would  be  straightforward 

using  the  convolution  approximation  described  in  [14].)  We  may  arrange  things  so  that 
the  functions  describing  each  are  “smoother”  the  larger  t  is.  Then  it  is  natural  to 
ask,  will  it  be  easier  to  find  a  global  minimum  of  the  error  surface  or  performance  index 
for  which  is  “very  smooth”  than  for  an  which  is  only  “moderately  or  a  little 
smooth”?  How  will  minima  move  across  the  error  surfaces  as  t-^0?  Will  global  minima 
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transform  to  global  minima  as  t  increases? 


Terence  Burning  of  the  University  of  Central  Florida  is  currently  carrying  out 
work  to  test  the  feasibility  of  this  idea  in  a  computer  simulation. 

4.  A  fundamental  mathematical  problem  is  raised  by  this  work.  If  we  are  given 
functions  which  approximate  one  another,  say  F-  ~  Gj  for  z  =  1,  2,  •  •  •,  r,  then  when  can 
we  be  sure  that  we  will  have  o  ^  o  •  •  •  o  ~  G,.  o  o  •  •  •  o  Gj^?  Is  there  a 

way  to  construct  F^-’s  and  Gj-’s  to  assure  they  will  always  have  this  property? 
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Abstract 


A  tunable  erbium  doped  fiber  ring  laser,  pumped  with  a  980  nm  InGaAs  diode  laser 
was  constructed.  Output  power  as  a  function  of  pump  power  and  output  wavelength  for 
a  given  fiber  length  was  measured.  Spectral  and  temporal  analysis  of  the  signal 
showed  mode-locked  pulses  of  short  duration  and  broad  frequency  content,  as  well  as 
a  CW  component  confined  to  a  relatively  narrow  optical  frequency  range.  The 
mechanisms  for  this  type  of  mode-locking  are  discussed  as  well  as  the  limitations 
placed  on  the  lasing  line  width  and  spectral  tunabiltiy. 
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A  FIBER  OSCILLATOR  FOR  OPTICAL  COMMUNICATIONS 


Kenneth  J.  Teegarden 
Salahuddin  Qazi 


INTRODUCTION 

The  recent  rapid  development  and  optimization  of  optical  amplifiers  based  on 
flourescent  fibers  has  proceeded  to  the  point  that  compact,  well  engineered,  products 
are  available  from  a  number  of  companies.  Attention  has  now  turned  to  the 
development  of  oscillators  which  also  employ  erbium  doped  fibers  as  the  gain 
medium.  Flourescent  fiber  oscillators  show  great  promise  as  sources  of  short  pulses  or 
solitons  in  the  1.55  micron  telecommunications  window.  They  can  be  passively  mode- 
locked,  and  pulses  of  femtosecond  duration  have  been  reported  with  high  peak 
powers.  Most  of  the  work  so  far  has  required  high  power  pump  lasers  and  specially 
designed  amplifiers.  The  objective  of  the  work  reported  in  this  paper  was  to  determine 
if  commercially  available  amplifiers,  employing  relatively  low  power  diode  lasers  as 
pumps,  could  be  used  to  develop  oscillators  with  some  of  the  short  pulse 
characteristics  mentioned  above.  In  addition,  the  tunability  of  such  oscillators  and  their 
performance  in  wave  length  multiplexed  communication  systems  was  explored. 

EXPERIMENTAL 


The  experimental  arrangement  used  in  this  work  is  shown  in  figure  1 .  The  fiber 
oscillator  consisted  of  a  erbium  doped  fiber  amplifier  developed  by  Corning  Inc.,  and  a 
feedback  loop  assembled  from  single  mode  fiber  and  commercially  available 
components.  The  model  P3-35  amplifier  contained  approximately  20  meters  of  erbium 
doped  fiber,  as  well  as  a  thermoelectrically  cooled  semiconductor  pump  laser  which 
developed  a  maximum  of  100  mW  and  the  WDM  needed  to  combine  both  the  pump 
and  signal  in  the  the  doped  fiber.  The  amplifier  module  had  an  output  port  through 
which  the  pump  power  launched  into  the  gain  medium  could  be  monitored  as  a 
function  of  pump  current.  At  the  maximum  rated  pump  current  of  200  mA,  the  amplifier 
had  a  gain  of  28  dB  at  1550  nm.  its  optical  bandwidth  was  listed  as  40  nm.  The  output 
of  the  amplifier  was  fed  back  into  its  input  port  through  a  splitter,  which  determined  the 
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feedback  and  output  coupling  ratio,  and  an  optical  isolator.  The  characteristics  of  the 
resulting  oscillator  output  were  determined  with  either  a  optical  spectrum  analyzer,  a 
fast  detector  and  elecrical  signal  analyzer,  or  an  optical  power  meter. 


RESULTS 


At  a  given  value  of  amplifier  pump  power,  the  output  of  the  osdilator  consisted  of 
two  distinct  components.  These  were  an  unmodulated  (although  noisy)  signal 
with  a  relatively  narrow  spectrum  containing  several  temporal  modes,  and  a 
periodic  signal  consisting  of  a  series  of  pulses  with  half  widths  less  than  one 
nanosecond  associated  with  an  optical  spectrum  of  about  7.0  nm  half  width.  At 
pump  powers  well  above  the  threshold  for  oscillation,  the  output  consisted  of 
both  types  of  signal.  Below  a  certain  critical  value  of  the  pump  power  the  mode  of 
oscillation  characterized  by  the  periodic  short  pulses  existed  alone.  These 
results  are  illustrated  in  figures  2  through  8. 

The  average  output  power  of  the  oscillator  was  measured  as  a  function  of  pump 
power  for  two  different  values  of  the  output  coupler,  using  a  power  meter  with  a 
time  constant  too  slow  to  follow  rapid  temporal  fluctuations  in  the  output.  The 
result  of  these  measurements  is  shown  In  figure  2.  As  noted  above,  the  optical 
spectrum  of  the  output  changed  with  the  pump  power.  At  low  pump  power,  near 
threshold,  the  output  spectrum  was  characterized  by  the  relatively  broad 
spectrum  shown  in  figure  4,  while  at  maximum  pump  power,  oscillation  occurred 
mainly  over  a  much  narrower  spectral  range  as  shown  in  figure  6. 

The  temporal  behavior  of  the  output  of  the  oscillator  was  examined  as  a  function 
of  pump  power  using  a  detector-  amplifier  combination  vwth  a  band  width  of 
SOOMhz  and  an  electical  signal  analyzer  with  a  band  width  of  1.0  GHz.  The 
results  are  shown  in  figures  3.  and  5.  At  low  pump  powers,  corresponding  to  the 
optical  spectrum  shown  in  figure  4.,  the  output  was  found  to  contain  the  periodic 
train  of  short  pulses  shown  in  figure  3.  As  the  pump  power  was  increased,  the 
width  of  the  pulses  increased  and  a  second  series  of  pulses  developed  between 
the  components  of  the  first  series.  The  situation  for  the  maximum  pump  current  of 
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150  mA.  is  shown  in  figure  5.  The  corresponding  optical  spectrum  is  given  in 
figure  6.  A  careful  examination  of  this  optical  spectrum  revealed  that  it  consisted 
of  the  narrow  lines  shown  in  figure  6  superimposed  on  the  broad  structure 
shown  in  figure  3.  This  result  can  be  stated  In  the  following  way.  As  the  amplifier 
pump  power  is  increased,  oscillation  first  occurs  in  the  form  of  a  periodic  series 
of  short  pulses  with  a  broad  spectral  output  having  a  band  width  of  about  7.0  nm. 
The  broad  spectral  output  and  pulsed  behavior  persists  at  higher  pump  powers 
but  in  addition  a  narrow  component  to  the  optical  spectrum,  consisting  of  several 
lines  with  widths  less  than  1.0  nm,  appears. 

The  period  of  the  pulses  in  the  output  of  the  oscillator  was  found  to  be  equal  to 
the  round  trip  time  of  the  amplifier  and  its  external  feed  back  loop.  Figure  7 
shows  the  result  of  inserting  first  one,  and  then  two,  5  meter  lengths  of  single 
mode  fiber  into  the  feedback  loop.  The  first  point  on  the  curve  shown  in  figure  7 
was  obtained  by  calculating  the  length  of  fiber  in  the  amplifier  and  external  feed 
back  loop  from  a  measured  value  of  the  period  of  the  pulses,  using  a  value  of 
1.50  for  the  effective  index  of  refraction  of  the  fiber.  The  last  two  points  were 
obtained  by  using  measured  values  of  the  period  and  fiber  lengths  obtained  by 
adding  5.0  and  10.0  meters  to  the  original  length.  The  fact  that  these  three  points 
lie  on  a  straight  line  indicates  that  indeed  the  period  of  the  pulses  is  equal  to  the 
time  required  for  a  short  pulse  of  light  to  make  one  round  trip  through  the 
amplifier  and  its  feed  back  loop. 

The  width  of  the  pulses  obtained  at  low  pump  powers  was  found  to  be  limited  by 
the  band  pass  of  the  detection  system  used.  With  a  detector  band  pass  of  500 
Mhz  the  width  was  measured  to  be  2.0  ns.  When  the  detector  band  pass  was 
increased  to  a  nominal  1.0  GHz  the  result  shown  in  figure  8  was  obtained.  Here 
a  pulse  width  of  about  600  ps  is  indicated.  At  maximum  pump  power  the  pulse 
width  was  measured  to  be  10.0  ns. 

It  should  be  noted  that  the  results  reported  above  for  the  case  of  a  commercially 
available  amplifier  could  be  duplicated  using  several  different  amplifiers 
constructed  in  our  laboratory.  It  thus  appears  that  they  may  be  characteristic  of 
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any  oscillator  based  on  erbium  doped  fibers. 


CONCLUSIONS 


The  evidence  presented  above  Indicates  that  self-starting  harmonic  mode  locking 
occured  in  the  output  of  this  erbium  doped  fiber  ring  oscillator.  Although  others^ 
have  described  a  variety  of  similar  effects  obtained  with  the  use  of  non-linear 
amplifying  loop  mirrors,  no  unusual  effort  had  to  be  made  to  induce  the  mode  locking 
in  our  case.  In  fact,  the  generation  of  short  periodic  pulses  occurred  spontaneously 
over  a  wide  range  of  pump  power  and  output  power.  Because  of  instrumental 
limitations  no  conclusions  can  as  yet  be  drawn  as  to  the  exact  width  of  of  the  mode- 
locked  pulses  obtained  at  low  pump  powers,  although  they  appear  to  be  sub¬ 
nanosecond  in  duration.  The  7.0  nm  optical  band  width  of  the  pulses  in  the  low  power 
regimen  indicates  a  minimum  pulse  width  of  about  2  ps.  Since  the  power  of  the  pulses 
seems  to  be  clamped  at  some  peak  value,  their  width  must  increase  at  higher  pump 

powers,  as  observed.  A  similar  effect  has  been  observed  by  Richardson,  et.  al.^ 


The  relatively  large  spectral  line  width  associated  with  the  spontaneous  mode  locking 
has  implications  relative  to  tunabilty  and  the  use  of  such  oscillators  in  wavelength 
multiplexed  communications  systems.  In  the  absence  of  mode-locking,  the  data  given 
above  indicate  a  possible  spectral  line  width  of  less  than  one  nanometer.  Mode¬ 
locking  widens  the  output  spectrum  at  its  base  to  7.0  nm.  Obviously  this  increased  line 
width  will  lead  to  cross  talk  and  a  sizable  reduction  in  the  number  of  channels 
available  in  a  multiplexed  system. 

Finally,  the  mechanism  which  spontaneously  couples  modes  at  low  pump  powers  is 
not  clearly  understood.  It  should  be  remembered,  however  that  erbium  represents  a 
three  level  laser  system  in  which  both  gain  and  absorption  loss  can  be  present 
simultaneously.  In  these  longitudinally  pumped  oscillators,  part  of  the  fiber  may  be 
operating  as  a  saturable  absorber  while  the  rest  acts  as  the  gain  medium.  It  would  thus 
be  interesting  to  determine  if  similar  results  are  obtained  in  a  four  level  gain  medium, 
such  as  neodymium  doped  fiber. 
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SPECTRUM 

ANALYSER 


Figure  1 .  Experimental  setup  of  fiber  oscillator. 


Pump  Power  (art)  units) 


Figure  2.  Oscillator  output  power  as  a  function  of  pump  power. 
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Figure  3.  Output  waveform  of  oscillator  at  low  pump  power. 
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Cavtty  Length  (Meters) 


Figure  7.  Period  of  mode-locked  pulses  as  a  function  of  cavity  length. 
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Abstract 

This  document  reports  on  the  results  of  a  computer  simulation  analysis  of  a  projected 
micro-computer  network  in  the  XP  directorate  at  Rome  Laboratories.  The  purposes  of 
the  simulation  study  were:  (a)  to  effectively  model  the  proposed  Apple  Talk  network  in 
XP  via  a  computer  simulation  model,  (b)  to  perform  ’‘what  if”  tests  with  the  model  in 
order  to  make  recommendations  for  future  additions  to  the  network,  and  (c)  to  conduct 
preliminary  tests  on  the  effectiveness  of  various  advanced  simulation  variance  reduction 
techniques  on  the  network  model  in  order  to  form  a  baseline  for  future  work  in  integrating 
numerous  simulation  models  already  in  use  at  Rome  Laboratories.  The  results  of  this 
study  are:  (a)  a  validated  and  accurate  computer  simulation  model  of  the  proposed  Apple 
Talk  network  in  XP  at  Rome  Laboratories  that  includes  a  fully  documented  prescription 
of  statistical  analysis  procedures  for  estimating  the  performance  measures  of  interest  from 
this  model,  (b)  a  preliminary  study  of  the  important  “what  if”  scenarios  to  determine 
their  relative  value,  and  (c)  a  preliminary  manual  for  implementing  important  variance 
reduction  techniques  with  this  model.  The  first  two  results  provide  XP  with  a  good 
understanding  of  the  operating  characteristics  of  the  Apple  Talk  network  under  a  number 
of  scenarios  of  interest  and  will  provide  them  with  the  knowledge  to  determine  how  to 
develop  the  network  in  the  future.  The  last  result  will  lay  a  solid  foundation  for  future 
work  in  integrating  the  simulation  models  at  Rome  Laboratories. 
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Simulation  Analysis  of  a  Local  Area  Network  for  XP 
at  Rome  Laboratories 

Jeffrey  D.  Tew 


1.  Introduction 

This  document  is  a  report  of  a  computer  simulation  analysis  of  a  projected  micro¬ 
computer  network  in  the  XP  directorate  at  Rome  Laboratories.  There  were  three  main 
purposes  for  conducting  this  study:  (a)  to  effectively  model  the  proposed  installed  Apple 
Talk  network  in  XP  via  a  computer  simulation  model,  (b)  to  perform  “what  if”  tests 
with  the  model  in  order  to  make  recommendations  for  future  additions  to  the  network, 
and  (c)  to  conduct  preliminary  tests  on  the  effectiveness  of  various  advanced  simulation 
variance  reduction  techniques  on  the  network  model  in  order  to  form  a  baseline  for  future 
work  in  integrating  numerous  simulation  models  already  in  use  at  Rome  Laboratories. 

The  construction  of  an  effective  computer  simulation  model  of  the  Apple  Talk  net¬ 
work  involved:  (a)  conducting  an  input  modelling  study  of  the  random  components  in  the 
model,  (b)  verification  that  the  computer  code  which  comprises  the  model  is  correct,  (c) 
validating  the  performance  of  the  model  against  the  performance  of  the  real  network,  and 
(d)  constructing  accurate  estimators  of  the  desired  performance  measure  characteristics. 

The  process  of  conducting  preliminary  tests  on  the  effectiveness  of  various  variance 
reduction  techniques  on  this  model  involved  conducting  numerous  Monte  Carlo  compar- 
ision  studies  with  the  model  in  order  to  measure  the  percent  variance  reduction  achieved 
with  this  model  for  each  of  the  performance  measures  of  interest  and  for  each  of  the 
variance  reduction  techniques  considered.  This  study  focused  on  the  variance  reduction 
techniques  of:  (a)  common  random  numbers,  (b)  antithetic  variates,  and  (c)  control 
variates. 
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Together,  these  three  phases  of  the  study  will  yielded:  (a)  fully  validated  and 
accurate  computer  simulation  model  of  the  existing  Apple  Talk  network  in  XP  at  Rome 
Laboratories  that  includes  a  fully  documented  prescription  of  statistical  analysis  proce¬ 
dures  for  estimating  the  performance  measures  of  interest  from  this  model,  (b)  an  study 
of  the  important  “what  if”  scenarios  to  determine  their  relative  value,  and  (c)  a  prelimi¬ 
nary  manual  for  implementing  important  variance  reduction  techniques  with  this  model. 
The  last  result  will  lay  a  solid  foundation  for  future  work  in  integrating  the  simulation 
models  at  Rome  Laboratories. 

In  the  next  section,  we  give  a  brief  description  of  the  existing  Apple  Talk  network  in 
XP.  In  Section  3  we  outline  the  development  of  the  computer  simulation  model  used  in 
this  study.  Section  4  gives  a  brief  discussion  of  how  the  “what  if”  tests  were  be  performed 
as  well  as  a  discussion  and  analysis  of  the  numerical  results.  Section  5  gives  a  detailed 
development  of  the  variance  reduction  techniques  that  were  developed  as  part  of  this 
project. 


2.  XP  Computer  Network 

The  XP  directorate  at  Rome  Laboratories  is  commissioned  with  the  task  of  providing 
Rome  Laboratories  with  cross-directorate  strategic  planning  and  technical  program  sup¬ 
port.  A  critical  component  which  enables  the  XP  personnel  to  effectively  carry-out  that 
task  is  the  ability  to  efficiently  communicate  on  a  technical  software  basis  with  one  an¬ 
other  and  with  those  located  in  other  directorates  on  base  as  well  as  other  facilities 
throughout  the  world. 

Currently,  it  is  felt  that  the  level  this  type  of  communication  made  available  to  XP 
personnel  is  inadequate.  That  is  to  say,  that,  increasingly,  the  XP  personnel  are  finding 
it  more  difficult  to  communicate  on  a  technical  software  basis  with  their  colleagues.  Con¬ 
sequently,  it  has  been  felt  that  improved  computer  communications  capabilities  among 
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the  XP  personnel  would  solve  problem  and  bring  about  an  immediate  improvement  in 
the  level  of  technical  software  interfaceing  among  the  XP  personnel.  To  that  end,  the 
director  of  XP  has  initiated  the  installation  of  a  30  workstation  Machintosh  network 
operating  under  the  Apple  Talk  protocal  environment.  This  initial  installation  process 
began  in  August  1991  and  was  expected  to  be  completed  by  1  December  1991  (the  net¬ 
work  is  unoperational  at  this  writing).  The  installation  process  has  involved  5  major 
steps  indicated  below: 

1.  Physical  installation  and  connection 

2.  Setup  of  Macintosh  equipment 

3.  Upgrade  and  setup  of  printing  resources 

4.  Setup  of  DOS  equipment 

5.  Connection  to  external  networks 

Figure  1  gives  a  diagramatic  representation  of  the  network.  The  network  config¬ 
uration  is  basically  that  of  a  backbone  network  with  nodes  connecting  directly  to  the 
backbone.  Traffic  handling  on  the  backbone  is  to  be  done  by  a  central  server  called  a 
Star  Controller.  Nodes  on  the  network  are  of  two  types:  (a)  Macintosh  machines  and 
(b)  DOS  machines.  Initially,  most  nodes  will  be  Macintosh  type  machines  (typically, 
Macintosh  Ilci  with  4  Mb  RAM).  Although,  provision  will  be  made  for  interfacing  with 
DOS-based  machines,  it  is  anticipated  that  most  nodes  in  the  future  will  continue  to  be 
Macintosh  type  machines.  Traffic  on  the  network  will  be  of  three  main  types:  (a)  data, 
(b)  video,  and  (c)  acknowledgement. 

The  Star  Controller  will  also  serve  as  a  node  on  the  Rome  Laboratories  interdi¬ 
rectorate  LONEX  computer  network  which  in  itself  serves  as  a  node  on  the  Internet 
network. 
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NETWORK  TOPOLOGY 


Figure  1(a):  XP  Macintosh  Network 
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NETWORK  TOPOLOGY 


Figure  1(b):  XP  M 
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3.  Simulation  Model 


The  actual  construction  of  the  simulation  model  was  carried  out  by  performing  the  fol¬ 
lowing  eight  steps: 

1.  Problem  identification 

2.  Specification  of  objectives 

3.  Definition  of  the  system 

4.  Validation  of  objectives  and  system  definition 

5.  Model  formulation 

6.  Model  verification 

7.  Model  validation 

8.  Model  implementation 

These  eight  steps  are  often  recognized  as  standard  means  for  building  effective  simu¬ 
lation  models  in  the  literature.  Chapter  5  of  Law  and  Kelton  (1990),  Chapter  3  of  Pritsker 
(1986),  and  Chapters  1  and  4  of  Bratley,  Fox,  and  Schrage  (1987)  all  offer  state-of-the-art 
discussions  and  the  topic  of  effective  simulation  model  building.  Basically,  none  of  them 
differ  by  much  from  the  presribed  approach  given  above.  Next  we  give  a  brief  discussion 
of  the  implementation  of  each  of  these  eight  steps  in  the  development  of  the  simulation 
model. 

1.  Problem  Identification:  This  step  of  the  process  involved  extensive  discussions 
with  XP  personnel  in  order  to  determine  the  actual  structure  of  the  network  as 
well  as  the  determination  of  which  performance  measure  and  input  factors  were 
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of  most  interest.  Through  these  discussions,  we  were  able  to  determine  that:  (a) 
the  mean  time  in  the  network  for  all  packets  was  the  performance  measure  of  most 
interest  (y),  (b)  the  mean  think  time  between  user  commands  on  the  network  (xi) 
and  the  number  of  CPU  units  at  the  Star  Controller  (X2)  were  the  two  input  factors 
of  most  interest,  and  (c)  one  hour’s  operation  during  a  peak  user’s  period  was  the 
time  frame  of  interest. 

2.  Specification  of  Objectives:  In  this  phase  we  determined  that  the  mean  packet 
time  in  the  network  was  of  most  interest. 

3.  Definition  of  System:  The  logical  flow  of  the  packets  was  determined  to  be  the 
following:  packets  are  generated  (created)  at  each  user  on  the  network.  There  are 
two  kinds:  (a)  those  that  are  to  be  routed  to  another  user  on  the  network  and 
(b)  those  that  are  to  be  sent  to  the  Star  Controller.  For  purposes  of  this  initial 
model  it  was  determined  that  packet  traffic  leaving  the  network  and  coming  into 
the  network  from  other  networks  would  not  be  included  in  the  model. 

4.  Model  Formulation:  As  indicated  above,  the  simulation  model  was  coded  in 
the  SLAM  II  simulation  software.  A  description  of  this  code  is  given  later  in  this 
section.  Since  the  system  was  not  operational  at  the  time  of  the  simulation  model 
development,  we  decided  to  use  traffic  composition  distributions  and  interarrival 
time  distributions  provided  in  the  literature. 

5.  Model  Verification:  Upon  completion  of  the  model  code,  we  tested  the  model 
under  different  scenarios  to  determine  if  the  code  was  performing  correctly. 

6.  Model  Validation:  Again,  since  the  network  was  not  operational  at  the  time  of 
the  development  of  the  simulation  model,  we  used  the  face  validity  of  the  model 
outputs  as  determined  by  some  of  the  personnel  at  XP. 
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7.  Model  Implementation;  This  involved  two  primary  parts:  (a)  selection  and 
testing  of  appropriate  statistical  estimation  methodologies  for  each  performance 
measure  of  interest  and  (b)  establishing  tactical  simulation  run  characteristics  (that 
is,  how  long  is  each  simulation  run  to  be  and  how  do  we  start  each  model  run).  The 
former  is  discussed  in  detail  in  Section  4.  Consideration  of  the  latter  part  resulted 
in  determining  that  a  simulation  run  was  to  be  for  one  hour  of  simulation  time 
with  the  model  starting  empty  and  idle  and  with  a  thirty  minute  burn-in  period  to 
control  for  initialization  bias. 

Next,  we  discuss  the  SLAM  II  code  of  the  simulation  model. 

A  copy  of  the  SLAM  II  code  for  the  simulation  model  of  the  XP  micro-computer 
network  is  given  in  Appendix  1.  This  model  was  developed  using  the  Network  modeling 
component  in  SLAM  II  (see  Chapters  5,  6,  and  7  of  Pritsker  1986).  We  used  27  separate 
CREATE  nodes  to  model  the  27  nodes  on  the  network  (see  Figure  1).  Each  used  an 
Exponential  distribution  to  model  the  time  between  arrivals  (creations)  of  messages  to 
the  network.  Upon  creation,  a  number  from  a  the  Poisson  distribution  with  mean  11  is 
generated  and  represents  the  number  of  frames  in  that  particular  packet.  Each  frame 
consists  of  1526  bytes  of  information.  Each  message  is  sent  immediately  to  the  Star 
Controller  for  processing. 

At  the  Star  Controller  messages  are  either:  (a)  processed  and  a  return  set  of  messages 
is  sent  back  to  the  point  of  origination  or  (b)  the  newly  created  message  is  routed  to  its 
destination.  Thus,  we  have  two  basic  types  of  messages  that  are  created  at  each  of  the 
27  users  (nodes)  on  the  network:  (a)  those  that  are  to  be  sent  to  other  users  on  the 
network  and  (b)  those  that  are  to  be  sent  to  the  Star  Controller  for  processing.  25%  of 
the  created  messages  are  to  go  to  the  Star  Controller  and  75%  are  to  be  routed  to  one  of 
the  other  26  nodes.  In  our  model,  we  used  a  discrete  uniform  distribution  to  determine 
the  destination  of  a  message.  Since  the  XP  network  was  not  operational  during  the  time 
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of  this  study,  all  distributions  used  in  this  study  were  taken  from  the  existing  literature 
on  successful  simulation  studies  of  similar  Local  Area  Netorks  (LAN)  (see  Mills  1988; 
Devai,  Kerrigan,  and  Molloy  1990;  and  Kavi,  Frost,  and  Shanmugan  1991).  Waiting  for 
processing  at  the  Star  Controller  is  done  in  a  single  queue  until  the  message  is  processed 
and  sent  back  onto  the  network  to  its  point  of  termination  (note,  all  transmission  times 
to  and  from  the  Star  Controller  were  taken  to  be  .00003  seconds).  Message  processing 
time  at  the  Star  Controller  is  .00003  times  the  total  number  of  bytes  in  the  message  and 
waiting  in  queue  is  done  on  a  FIFO  queue  priority  basis. 

Upon  reaching  its  point  of  termination,  statistics  on  the  time  spent  in  the  network 
and  the  total  number  of  bytes  contained  in  the  message  are  collected  at  each  termination 
nodel  as  well  as  accumulated  across  all  termination  nodes.  In  addition,  the  standard 
set  of  statistical  outputs  are  generated  by  SLAM  II  Summary  Report  for  this  model. 
An  example  of  such  an  output  report  is  given  in  Appendix  11.  Among  other  things, 
this  report  gives  statistics  on  the  average  queue  length  at  the  Star  Controller  queue, 
the  average  utilization  of  the  Star  Controller  processor,  and  the  total  number  of  bytes 
processed  ,  among  others.  In  the  next  section,  we  discuss  the  experimentation  that  was 
done  with  this  simulation  model. 


4.  Simulation  Experiments 

Once  the  simulation  model  was  developed,  a  number  of  experiments  were  performed. 
Two  of  these  are  summarized  below.  The  personnel  at  XP  were  interested  in:  (a)  the 
mean  time  spent  in  the  network  by  a  message  (j/),  (b)  the  mean  interarrival,  in  seconds, 
(creation)  time  of  messages  at  each  of  the  27  user  nodes  on  the  network  (xi),  and  (c) 
the  number  of  Star  Controller  CPU’s  (0:2).  The  first  of  these  is  the  response  variable  of 
interest  and  the  latter  two  are  the  input  factors  of  interest. 

First,  we  considered  the  relationship  between  y  and  x^.  For  Xi,  4  different  levels  were 


12-13 


chosen  that,  together,  represent  the  design  space  of  interest  to  XP.  The  levels  chosen  were 
xi  =  5,  8, 11, 14  seconds.  For  each  level  of  Xj  a  set  of  5  simulation  runs  was  made.  The 
statistical  software  package  SAS  was  used  to  perform  a  regression  analysis  on  the  data  in 
order  to  ascertain  whether  a  second-order  model  adequately  described  the  relationship 
between  E[y]  and  Xj.  Specifically,  the  SAS  procedure  PROC  RSREG  (see  the  SAS 
manual)  was  used  to  carry  out  the  calculations.  The  results  of  this  process  are  given  in 
Figure  2. 

From  these  results  we  see  that  a  second-order  model  does  describe  the  desired  re¬ 
lationship  well  (the  lack-of-fit  test  for  the  second-order  model  has  a  p-value  of,  approxi¬ 
mately,  0.0000).  Thus,  we  concluded  that  the  relationship  between  E[y]  and  Xi  is  given 
as: 

E[y]  =  4354.54  -  476.14xi  +  U.OTx^ 

From  this  fitted  quadratic  model,  a  canonical  analysis  was  also  performed  to  determine 
that  the  minimum  response  (y)  was  attained  at  the  point  x\  =  16.92  and  that  it  resulted 
in  a  minimum  response  value  of  y^rin  ~  326.95  seconds. 

Second,  we  considered  the  relationship  between  y  and  X2.  For  X2,  again,  4  different 
levels  were  chosen  that,  together,  represent  the  design  space  of  interest  to  XP.  The  levels 
chosen  were  X2  =  1,2, 3,  4  seconds.  For  each  level  of  X2  a  set  of  5  simulation  runs  was 
made.  The  statistical  software  package  SAS  was  used  to  perform  a  regression  analysis 
on  the  data  in  order  to  ascertain  whether  a  second-order  model  adequately  described  the 
relationship  between  E[y]  and  X2.  Specifically,  the  SAS  procedure  PROC  RSREG  (see 
the  SAS  manual)  was  used  to  carry  out  the  calculations.  The  results  of  this  process  are 
given  in  Figure  3. 

As  before,  from  these  results  we  see  that  a  second-order  model  does  describe  the 
desired  relationship  well  (the  lack-of-fit  test  for  the  second-order  model  has  a  p-value  of, 
approximately,  0.0000).  Thus,  we  concluded  that  the  relationship  between  E[y]  and  Xj 
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is  given  as: 

E[y\  =  4441.83  -  2444.37x2  +  331.93x2 

From  this  fitted  quadratic  model,  a  canonical  analysis  was  also  performed  to  determine 
that  the  minimum  response  (y)  was  attained  at  the  point  X\  =  3.68  and  that  it  resulted 
in  a  minimum  response  value  of  =  —58.28  seconds;  which,  of  course,  is  not  feasible. 
However,  clearly  a  quadratic  model  of  this  relationship  is  appropriate  and  that  an  addition 
of  one  extra  Star  Controller  will  significantly  reduce  the  mean  time  that  a  message  spends 
in  the  system. 

These  two  regression  models,  and  others  like  them  constructed  from  the  existing 
SLAM  II  simulation  code,  can  be  used  to  guide  XP  in  designing  future  additions  to  the 
network.  Based  on  the  results  of  presented  here  and  under  the  current  message  traffic 
patterns  on  the  network,  we  would  recommend  that  at  least  two  Star  Controller  CPU 
units  be  installed. 

For  each  regression  model  described  above,  the  variance  reduction  technique  enti¬ 
tled  Combined  Method  III  was  used  to  effecively  reduce  the  variance  of  the  estimated 
regression  coefficients.  The  observed  variance  reduction  was  on  the  order  of  at  least  76% 
in  all  cases.  A  full  discussion  of  the  variance  reduction  techniques  developed  as  a  part  of 
this  project  is  given  in  the  next  section. 


5.  Variance  Reduction  Techniques 

In  this  section,  we  describe  the  variance  reduction  methodologies  that  were  developed 
in  order  to  improve  the  estimation  of  the  regression  model  coefficients  presented  in  the 
previous  section. 
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MEAN  TIME  IN  SYSTEM  VERSUS  HEAII  IHTERARRIVAL  TIME 
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Figure  2(a):  SAS  Results 
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MEAN  TIME  IN  SYSTEM  VERSUS  MEAN  INTERARRIVAL  TIME 
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Figure  2(b):  SAS  Results 
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MEAN  TIME  IN  SYSTEM  VERSUS  NUMBER  OF  CPU  UNITS 
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Figure  3(a):  SAS  Results 
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Figure  3(b):  SAS  Results 
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5.1  Notation 


In  a  designed  simulation  experiment,  the  analyst  is  usually  concerned  with  estimating  the 
mean  of  a  univariate  response  of  interest  based  on  the  outputs  of  a  carefully  structured 
set  of  runs.  Frequently,  large-scale  simulation  studies  require  extensive  experimentation 
to  obtain  acceptable  precision  in  the  estimator  of  interest.  If  we  can  reduce  the  variance 
of  this  estimator  at  little  additional  cost,  then  we  can  also  reduce  the  requisite  amount  of 
experimentation.  In  this  section  we  propose  three  ways  of  combining  the  two  standard 
variance  reduction  techniques  of  antithetic  variates  and  control  variates  to  yield  more 
precise  estimators  of  the  mean  response. 

For  a  single-model  simulation  experiment  (i.e.,  one  system  configuration  or  design 
point),  the  methods  of  antithetic  variates  and  control  variates  are  probably  among  the 
most  commonly  applied  variance  reduction  techniques  (Law  and  Kelton  1991,  Chap¬ 
ter  11).  The  method  of  antithetic  variates  assigns  complementary  streams  of  random 
numbers  to  pairs  of  simulation  runs  taken  at  a  single  design  point  to  induce  a  negative 
correlation  between  the  corresponding  responses.  Let  y\  and  ?/2  denote  two  responses 
obtained  by  antithetic  replicates  of  a  single  design  point.  Suppose  that  we  estimate 
fly  =  E(y,)  {i  =  1,2)  by  the  sample  mean  response  -f  2/2)-  Then  we  observe  that  in 
general, 

var[^(yi  -|-  2/2)]  =  ^var(2/i)  -|-  ^var(2/2)  +  -cov(2/i, 2/2)  =  +  ^'^<=>v{yi,y2). 

In  this  equation,  if  the  covariance  between  r/i  and  1/2  obtained  by  antithetic  replicates 
is  negative,  then  the  variance  of  the  sample  mean  is  less  than  that  obtained  by  two 
independent  replicates  (for  a  more  detailed  discussion  of  antithetic  variates  see  Section 
IIL6  of  Kleijnen  1974,  Section  2.2  of  Brat  ley.  Fox,  and  Schrage  1987,  and  Section  11.3  of 
Law  and  Kelton  1991). 

In  contrast  to  the  approach  of  antithetic  variates,  the  method  of  control  variates 
attempts  to  exploit  intrinsic  correlations  between  the  target  response  and  selected  auxil- 
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iary  outputs  (control  variates)  when  all  of  these  quantities  are  generated  within  a  single 
run.  Let  j/,-  and  c,  respectively  denote  the  response  of  interest  and  the  s  x  1  vector  of 
control  variates  obtained  from  the  ith  simulation  run.  We  assume  that  (z  =  1,2, . . .  ,2h): 


where  is  the  unconditional  variance  of  yi,  cr yc  is  the  s  x  1  covariance  vector  between 
yi  and  c,-,  and  Sc  is  the  s  x  s  covariance  matrix  of  c,-.  Also,  without  loss  of  generality,  we 
assume  E(c,)  =  O(ixi)-  In  the  context  of  performing  2h  independent  replications  of  the 
simulation,  joint  normality  of  the  response  and  the  controls  ensures  that  the  response 
can  be  represented  by  the  following  linear  model: 

y  = /rj,l(2/ixi)  +  Ca  +  e,  (2) 

where  y  =  {yi^y^,.. .  ,y2h)\  l(2/ixi)  is  a  2/i  x  1  vector  of  I’s,  C  is  a  2/i  x  s  control  variate 
matrix  whose  zth  row  consists  of  c/,  fiy  is  the  mean  response,  a  =  Sc^tr^c  is  the  s  x  1 
vector  of  control  coefficients,  and  e  is  the  2h  x  1  vector  of  error  terms  (see  Lavenberg, 
Moeller,  and  Welch  1982).  The  least  squares  estimators  of  a  and  y,y  in  the  linear  model 
(2)  are  given  by,  respectively: 

d  =  (C'PC)-‘C'Py  and  =  y  -  c'd,  (3) 

where  y  =  {2h)-^l\,^u)y  =  (2/z)-‘ E'=,  1/.  and  c'  =  (2/z)-M'(ix2Z.)C  =  (2/i)-i  c/ 
respectively  denote  the  sample  mean  of  the  responses  and  the  control  vectors  com¬ 
puted  across  2h  replications,  and  P  =  I(2/ix2/i)  —  ^l(2/ixi)lVx2/i)  (s®®  Searle  1971,  p. 
341).  Under  the  normality  assumption  in  (1),  the  components  of  the  error  vector  e  are 
IID  ^V(0,cry|p)  where 

<^y|c  =  var(j/,|c,)  =  (crj  -  o-'yc^c  ^<^yc)  (4) 
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(see  Theorem  2.5.1  of  Anderson  1984);  and  the  least  squares  estimator  p,y  is  an  unbiased 
estimator  for  fiy.  Lavenberg,  Moeller,  and  Welch  (1982)  showed  that  the  unconditional 
variance  of  fXy  is  given  by 

™r(Av)=  (5) 

where  is  the  square  of  the  multiple  correlation  coefficient  between 

y,-  and  c,-.  Lavenberg,  Moeller,  and  Welch  also  defined  the  quantity  2h-s-2 
factor  due  to  the  estimation  of  the  unknown  control  coefficient  vector  a  in  (2);  and  they 
identified  (1  —  R^c)  as  the  minimum  variance  ratio  which  represents  the  potential  for 
reducing  the  variance  of  the  estimator  of  Hy  by  the  control  variates.  Thus,  the  efficiency 
of  control  variates  is  measured  by  the  product  of  the  loss  factor  and  the  minimum  variance 
ratio. 

In  this  research,  our  main  interest  is  to  obtain  a  more  precise  estimator  of  the  mean 
response  fiy  by  combining  the  variance  reduction  techniques  of  antithetic  variates  and 
control  variates.  These  two  methods  utilize  correlations  between  simulation  outputs  that 
are  generated  either  from  a  single  run  or  from  different  runs  in  a  designed  simulation 
experiment.  Suppose  that  through  correlated  replications  of  a  given  simulation  model, 
we  get  a  reduced  variance  for  the  estimator  of  the  mean  response  and  yet  we  maintain  the 
same  correlation  between  the  response  y,  and  the  control  vector  c,  as  we  obtained  with 
independent  replications;  then  it  is  conjectured  that  we  may  apply  the  methods  of  anti¬ 
thetic  variates  and  control  variates  jointly,  thereby  obtaining  greater  efficiency  increases 
than  could  be  obtained  by  applying  either  antithetic  variates  or  control  variates  sepa¬ 
rately.  (Note  that  Avramidis  and  Wilson  (1992)  establish  general  conditions  for  effective 
combination  of  the  methods  of  control  variates,  correlation  induction,  and  conditional 
expectation  for  finite- horizon  simulation  experiments.) 

Based  on  this  conjecture,  this  research  focuses  on  developing  methods  for  combining 
antithetic  variates  and  control  variates  for  the  estimation  of  the  mean  univariate  simula- 
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tion  response.  For  this  purpose,  we  consider  three  ways  of  utilizing  correlations  between: 
(a)  multiple  replications  of  the  response  of  interest,  (b)  the  response  and  a  set  of  control 
variates  observed  on  a  single  replication,  and  (c)  multiple  replications  of  the  control  vari¬ 
ates.  Some  of  these  correlations  are  induced  by  an  appropriate  assignment  of  random 
number  streams  to  different  replications;  and  by  using  such  correlation-induction  strate¬ 
gies,  we  try  to  improve  upon  the  simulation  efficiency  of  the  classical  control  variates 
method  in  which  all  replications  are  independent.  The  three  combined  methods  differ 
only  in  which  elements  of  the  model  that  the  method  of  antithetic  variates  is  applied 
to.  There  are  three  obvious  choices:  (a)  use  control  variates  in  conjuction  with  anti¬ 
thetic  variates,  where  the  method  of  antithetic  variates  is  only  applied  to  the  response 
y,  (b)  use  control  variates  in  conjuction  with  antithetic  variates,  where  the  method  of 
antithetic  variates  is  applied  to  just  the  control  variates,  and  (c)  use  control  variates  in 
conjuction  with  antithetic  variates,  where  the  method  of  antithetic  variates  is  applied 
to  both  the  response  y  and  the  control  variates.  These  constitute  the  three  combined 
methods  discussed  in  Section  2. 

In  the  next  subsection  we  develop  the  three  combined  methods  for  jointly  applying 
control  variates  and  antithetic  variates. 


5.2  Simulation  Efficiency  of  Combined  Methods 

In  computer  simulation,  random  number  streams  that  drive  a  simulation  model  are  under 
the  control  of  the  experimenter  and  completely  determine  the  simulation  output.  Let  the 
random  number  stream  r,j  =  . . .)'  denote  the  potentially  infinite  sequence  of 

random  numbers  used  to  drive  the  jth  random  component  in  the  simulation  model  at  the 
fth  replicate,  where  {r,j/  :  I  =  1,2, . . .}  are  IID  (7(0, 1)  variates.  The  random  components 
in  a  queueing  simulation  model  may  include,  among  other  things,  the  sequence  of  service 
times  sampled  at  a  particular  service  center,  the  sequence  of  interarrival  times  sampled 
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from  a  given  arrival  process  to  a  network,  etc.  Here,  we  are  saying  that,  to  each  random 
component  in  the  simulation  model,  we  assign  a  separate  random  number  stream  to 
generate  realizations  of  the  corresponding  stochastic  simulation  input  process.  We  assume 
that  g  random  number  streams  are  required  to  drive  the  entire  simulation  model,  and  we 
let  R,  denote  the  complete  set  of  streams  used  for  the  ith  replication  of  the  model: 

R,-  ==  (r,i,  r,2, . . . ,  r.J  for  i  =  1, 2, . . . ,  2h. 

We  now  consider  the  random  number  assignment  strategy  of  jointly  utilizing  antithetic 
variates  and  control  variates  to  estimate  the  mean  response  gy.  To  this  end,  we  separate 
R,  into  two  mutually  exclusive  and  exhaustive  subsets  of  random  number  streams,  such 
that  R,  s  (R,i,R,2)  {i  =  1,2,...,2A).  The  first  subset,  R,i,  consisting  of  [g  —  5*) 
random  number  streams,  is  used  to  drive  the  non-control-variate  random  components 
in  the  model  so  that  c,  is  independent  of  R,i.  The  second  subset,  R,2,  consisting  of  s* 
random  number  streams,  is  used  to  drive  the  control-variate  random  components  in  the 
model  so  that  c,  is  a  function  of  R,'2.  These  properties  are  summarized  as  follows: 

c,-,  R.i  are  independent  and  c,  =  c,(R,2)  for  z  =  1, 2, . . . ,  2/i. 

Applying  the  method  of  antithetic  variates  to  the  appropriate  random  components  in  the 
simulation  model  may  induce  correlations  between:  (a)  responses,  (b)  control  variates, 
and  (c)  responses  and  control  variates,  across  replicates. 

Specifically,  we  consider  the  following  methods:  (a)  use  antithetic  variates  for  all 
random  components  except  the  control  variates,  (b)  use  antithetic  variates  on  only  the 
control  variates,  and  (c)  use  antithetic  variates  for  all  random  components.  Through 
statistical  analysis  and  simulation  experimentation,  we  will  explore  how  these  methods 
may  improve  the  simulation  efficiency  in  reducing  the  variance  of  the  estimator,  and 
what  conditions  are  necessary  for  each  method  to  ensure  an  improvement  in  variance 
reduction. 
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5.2.1  Combined  Method  I 


This  section  presents  a  method  for  combining  antithetic  variates  and  control  variates 
based  on  correlated  replicates  utilizing  only  the  non-control-variate  random  components 
in  the  model.  Recall  from  the  discussion  given  in  the  Introduction,  the  basic  idea  of 
this  method  (as  well  as  the  methods  presented  in  the  next  two  sections)  is  to  group  the 
replicates  into  h  antithetic  pairs.  Within  the  jth  pair  of  replicates,  Combined  Method 
I  uses  (R2j-i,i)  R-2i-i,2)  as  the  input  to  run  2j  —  1  and  (R2j_i,i,  R2j,2)  as  the  input 
to  run  2j,  where  R2j_i,i,  R22-i,2,  and  R2j,2  are  mutually  independent  sets  of  random 
number  streams,  and  R2j_i,i  is  the  set  of  random  number  streams  that  are  antithetic 
(complimentary)  to  those  which  comprise  R2_,_i,i  {j  =  1,2,...,  A):  R2j_i,i  =  {r2j-i,h  ■ 
k  —  1,2,. . .  ,g  —  s*),  where  denotes  the  random  number  stream  complimentary  to 

T2j—i^k  SO  that 


^2j-l,k 


1  - 

(  1  \ 
1  ~  ^2j-l.k,2 
\  / 


for  k  =  1,2, ...  ,g 


s 


Across  pairs  of  replicates,  this  method  uses  independent  random  number  streams.  Thus, 
the  jth  pair  of  responses,  i/2j-i  and  ?/2j  (i  =  C  2, . . . ,  h),  are  negatively  correlated  by  the 
use  of  antithetic  streams  on  the  non- control- variate  random  components.  However,  across 
all  2h  replicates,  the  control  variates  c,  {i  =  1,2, . . .  ,2h)  are  independently  generated 
by  the  assignment  of  randomly  selected  random  number  streams,  R,-,2,  used  to  drive  the 
control-variate  random  components  for  each  replicate.  Because  we  randomly  select  the 
random  number  streams  to  drive  the  control  variates,  the  response  y,  {i  =  1,2, .. .  ,2h) 
is  independent  of  the  control  variate  vector  c*,  {k  =  1,2,. . .  ,2/i  and  k  ^  i).  Based  on 
the  above  discussion,  we  establish  the  following  properties  (i  =  1,2, . . .  ,2h): 
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Property  1:  homogeneity  of  response  variances  across  replicates, 

var(t/,)  =  r  =  1,2,. . .  ,2/i.  (6) 

Property  2;  homogeneity  of  negative  correlations  across  replicate  pairs  and  indepen¬ 
dence  of  responses  between  replicate  pairs, 

cov{yk,yt)  =  -picrl  for  ^  =  /  -|-  1  (/  =  1, 3, . . . ,  2/1  -  1); 
cov(yk,yi)  =  0  otherwise. 

Property  3;  homogeneity  of  response-control  variates  covariances  across  replicates  and 
independence  of  response-control  variates  covariances  between  replicates, 

Cov(j/.,  c,)  =  <Tyc,  f  =  1,2,. . .  ,2/i; 

Cov{yi,Ck)  =  O(sxi),  for  i  7^  k. 

Property  4:  homogeneity  of  control  variates  covariances  across  replicates  and  inde¬ 
pendence  of  control  variates  between  replicates, 

Cov(c,)  =  =  1,2, . . . ,  2/i; 

Cov(c,-,ca:)  =  O(sxs)  for  i  7^  k. 

We  make  the  following  assumption 

Assumption  ;  0  <  pi  <  1. 

Under  the  four  properties  listed  above,  the  variance  of  the  mean  response,  =  |(y2j-i  + 
y2j),  and  mean  control  variate  vector,  Cj  =  j(c2j-i  -f  C2j),  within  the  ;th  replicate  pair 
{j  =  1, 2, . . . ,  /i)  are  respectively  given  by: 

var(y_,)  =  ^(var(y2j-i)  +  var(y2j)  +  2cov(y2j-i,  J/2j))  = 

and 

Cov(Cj)  =  — (Cov(c2j-i)  -f  Cov(c2j)  -1-  2Cov(c2j_i,  C2j))  =  2^*^' 
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Also,  the  covariance  matrix  between  yj  and  Cj  is  given  by; 

Cov(Sj,Cj)  =  icov{!/j,_i  +  !/2j,C2y-i  +  Cj,)  = 


(12) 


The  joint  normality  assumption  of  the  response  and  control  variates  gives  the  joint  dis¬ 
tribution  of  yj  and  as  follows; 


" 

/ 

" 

1 

yj 

Py 

.  . 

V 

0{iXl) 

’2 

(1  - 

CfyC 


(13) 


Consequently,  yj,  given  Cj,  is  normally  distributed  with  expectation  E(?/j|Cj)  =  fiy  c'-ol 
and  variance 

y&T{yj\Cj)  =  ^((1  -pi)cTj-o-j,c'S"Vyc)  =  (14) 

where  is  defined  to  be  ((1  -  pi)(T^  -  a-yc'E'^ (Xyc)  (see  Theorem  2.5.1  of  Anderson 
1984).  As  with  the  case  of  the  linear  relationship  in  (2),  the  h  x  h  vector  of  mean  paired 
responses,  y,  can  be  represented  as; 


yi 

y2 

yh 


=  Myl(/ixi)  +  Ca  -I-  e*. 


(15) 


where  C  is  a  /i  x  s  matrix  of  control  variates  whose  jth  row  is  c' .  Regression  analysis  on 
this  linear  model  yields  the  following  controlled  estimator  of  the  mean  response; 


1 


1 


h  =  jl'(ixM(y  -  C(C'QC)-'C'qy)  =  jl'„x/,)(I(Ax/.|  -  C(C'QC)-'C'Q)y,  (16) 

where  Q  =  I(/ix/i)  —  I'his  expression  for  fiy,  we  can  easily  show  (see 

Section  5.2.4)  that  its  unconditional  variance  is 


Var(^,)  = 


h  + 


hs 


{h-s-2)\ 


V  ^yc  I  I  I 


h-2 
h  —  s  —  2 


(17) 
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where  i?yc  is  the  multiple  correlation  coefficient  between  j/,-  and  c,  (i  —  1 , 2, . . . ,  2/i). 
Thus,  provided  the  assumption  that  0  <  pi  <  1  holds,  Combined  Method  I  will  result  in 
a  reduction  of  the  variance  of  Var(/ij,),  if  the  effects  due  to  antithetic  variates  (pi)  and 
the  control  variates  (/?yc)^,  together,  compensate  for  the  loss  factor 


5.2.2  Combined  Method  II 

This  section  considers  the  second  method  for  combining  control  variates  and  antithetic 
variates,  which  is  based  on  correlated  replications  utilizing  only  just  the  control  variates. 
Contrary  to  the  random  number  assignment  in  the  previous  section,  this  method  uses 
antithetic  variates  for  the  control-variate  random  components  and  independent  random 
number  streams  for  all  other  random  components  in  the  model.  With  this  replication 
strategy,  we  induce  negative  correlations  between  the  responses,  between  the  control 
variates,  and  between  the  response  and  the  control  variates  within  h  pairs  of  the  responses 
and  the  control  variates,  respectively,  obtained  from  (R2j_i,i,  R2;-i,2)  ^.nd  (R2;,i,  R2j-i,2) 
(j  =  1,2,  ...,/i).  However,  across  pairs  of  replications,  we  get  independent  outputs. 
Based  on  the  above  discussion  and  the  development  given  for  Combined  Method  I,  we 
note  the  following  properties  {i  =  1,2,...,  2h): 

Property  1:  homogeneity  of  response  variances  across  replicates, 

var(y, )  =  cr^  f  =  1, 2, . . . ,  2/i.  (18) 

Property  2:  homogeneity  of  negative  correlations  across  replicate  pairs  and  indepen¬ 
dence  of  responses  between  replicate  pairs, 

cov{yk,yt)  = for  /^  =  /  +  1  (/ =  1, 3, . . . ,  2/i  -  1 ); 
cov{yk,yi)  =  0,  otherwise. 
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Property  3;  homogeneity  of  response-control  variates  covariances  across  replicates, 
homogeneity  of  response-control  variates  covariances  across  replicate  pairs,  and 
independence  of  response-control  variates  covariances  between  replicate  pairs, 

Cov(2/,,c,)  =  CTyc  ^  =  1,2, . . .  ,2/i; 

Cov(yfc,  c/)  =  £r[,c  for  k  =  l+l  (/ =  1, 3, . . . ,  2/i  -  1);  '  (20) 

Cov(y,,Cj)  =  0(^x1),  otherwise. 

Property  4:  homogeneity  of  control  variates  covariances  across  replicates,  homogeneity 
of  control  variate  covariances  across  replicate  pairs,  and  independence  of  control 
variates  between  replicate  pairs, 

\ 

Cov(ci)  =  Sc  i  =  1,2, . . .  ,2/i; 

Cov(c,,c,)  =  S('^  for  A:  =  /-f  1  (/  =  l,3,...,2/i-l);  >  (21) 

Cov(cfc,Cj)  =  0(,xs),  (otherwise). 

We  make  the  following  assumption 

Assumption  :  0  <  P2  <  1- 

Under  the  four  properties  given  above,  the  variance  of  the  covariance  of  Cj  and  the 
covariances  between  yj  and  Cj  are,  respectively  ( j  =  1,2, ... ,  h): 

var(yj  )  =  ^(1  -  P2)o-J,  (22) 

Cov(cj)  =  -(Cov(C2_;-l)  -|-Cov(C2j)  +  2Cov(c2j-1,  C2j))  =  (2^) 

and 

Cov(yj,Cj)  =  -Cov(y2j-l  +  y2j,C2j-l  +  C2j) 

=  ^(Cov(y2j-l,C2j_i)  +  Cov(j/2j_1,C2j)  +  Cov(y2j ,  C2j-1 )  +  Cov(y2j,  C2j)) 

=  i(<^vc  +  0-ic)- 

(24) 
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Based  on  the  joint  normality  assumption  of  the  response  and  control  variates  and  the 
properties  given  in  (18),  (19),  (20)  and  (21),  we  have  the  joint  distribution  of  and 
is  given  as: 


V: 


N 


s  +  l 


f^y 

VL 


1 

’2 


{1-P2)al  o-yc'  +  o- 


/(2) 

yc 


'  yc 


+  o-S  Sc  + 


'yc 


(25) 


Since  Cj  {j  =  l,2,...,/i)  are  independent,  following  the  development  of  (17)  we  get  by 
an  analogous  approach: 


Var(,i,)  =  ^ 


{h-s-2)^ 


2h 


_  9 


(26) 


wjiere  is  defined  to  be 


r|  =  (1  -  -  (<r,e  +  <rP>)'(Ec  +  Sra)-‘(T,c  +  <rg)  (27) 

and 

=  ((1  -  (28) 

which  is  the  square  of  the  multiple  correlation  coefficient  between  and  Cj.  Thus, 
provided  the  assumption  0  <  P2  <  f  holds.  Combined  Method  II  will  result  in  a  reduction 
of  the  variance  of  Var(/iy),  if  the  effects  due  to  antithetic  variates  (^2)  s^nd  the  control 
variates  (/2yc  )^,  together,  compensate  for  the  loss  factor 


5.2.3  Combined  Method  III 

This  section  presents  Combined  Method  III  which  jointly  implements  control  variates 
and  antithetic  variates  based  on  correlated  replicates  induced  by  utilizing  all  random 
components  in  the  simulation  model.  Unlike  the  random  number  assignment  strategies 
discussed  in  the  previous  two  sections,  we  apply  antithetic  variates  to  all  random  com¬ 
ponents  in  the  model.  This  assignment  strategy  induces  correlations  across  h  pairs  of 


12-30 


both  the  response  and  the  control  variates.  That  is,  negative  correlations  are  induced 
between  the  reponses,  between  the  control  variates,  and  between  the  response  and  the 
control  variates  within  h  pairs  of  the  replicates.  However,  the  induced  correlation  be¬ 
tween  the  response  and  the  control  variates  is  different  from  that  of  Combined  Methods 
I  and  II.  Across  the  h  pairs  of  replicates,  the  mean  reponse  and  the  mean  of  the  control 
variates  (within  a  pair  of  replicates)  are  independently  observed  by  the  assignment  of 
different  sets  of  randomly  chosen  random  number  streams.  As  before,  we  identify  a  set 
of  properties  for  this  strategy  with  regard  to  the  covariance  structure  of  the  responses 
and  control  variates  in  the  experiment.  Based  on  the  above  discussion  and  the  develop¬ 
ments  for  the  other  two  combined  methods,  for  Combined  Method  III  we  establish  that 
(^  =  l,2,...,2/i): 

Property  1:  homogeneity  of  response  variances  across  replicates, 

var(y,)  =  cr2  f  =  1, 2, . . . ,  2/i.  (29) 

Property  2:  homogeneity  of  negative  correlations  across  replicate  pairs  and  indepen¬ 
dence  of  responses  between  replicate  pairs, 

cov{yk,yi)  =  for  k  =  l+  \  (/  =  1 , 3, . . . ,  2/i  -  1 ); 

cov(yj,  y*:)  =  0,  (otherwise). 

Property  3:  homogeneity  of  response-control  variates  covariances  across  replicates, 
homogeneity  of  response-control  variates  covariances  across  replicate  pairs,  and 
independence  of  response- control  variates  covariances  between  replicate  pairs, 

Cov(y,',c,)  =  o-yc  f  =  1,2, . . .  ,2/i; 

Cov(i/fc,  c;)  =  <7yc  for  k  =  l+l  (/ =  1,3, . . .  ,2/i  -  1);  ’  (31) 

Cov(y.,Cj)  =  0(,xi),  otherwise. 
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Property  4:  homogeneity  of  control  variates  covariances  across  replicates,  homogeneity 
of  control  variate  covariances  across  replicate  pairs,  and  independence  of  control 
variates  between  replicate  pairs, 


Cov(c,)  =  Sc  i  =  1,2,...,2; 

Cov(cfc,c,)  =  S(f)  for  A:  =  /+l  (/  =  l,3,...,2/i-l);  - 
Cov(c,,Ck)  =  O(sxs),  otherwise. 


(32) 


We  make  the  following  assumption 


Assumption  :  0  <  ps  <  1. 


Under  the  four  properties  given  above  we  obtain  analogous  results  to  those  given  in 
Section  5.2.2  (j  =  1, 2, . . . ,  /i); 

var(yj)  =  ^(1 -P3)<7j  (33) 

Cov(Cj)  =  -(Cov(c2j_l)  +  Cov(c2j)  +  2Cov(c2j_l,  C2j))  =  2^^*^ 

and 

Cov(yj,Cj)  =  -Cov(y22-l  +I/2j,C2j-l  +  C2j)  -  -(cTyC  +  ^-yc)-  (35) 

Under  the  joint  normality  assumption  of  the  response  and  control  variates  and  the  prop¬ 
erties  given  in  (29),  (30),  (31)  and  (32),  the  joint  distribution  of  y^  and  is  given  as: 


2/j 

c, 


N 


s  +  l 


0 


(sxl) 


I 

’  9 


(1-P3)^y  O-j^c'  +  O- 


+  <T 


d3) 
yc 

Sc  +  s<") 


\ 

jJ 


(36) 


'  t/C  “T  yC 

Since  Cj  {j  =  1,2,  ...,/i)  are  independent,  following  the  development  of  (26)  we  get  by 
an  analogous  approach: 


Var(/iy) 


(1  -p3)o-; 


2h  \h  —  s  —  2 , 


2h 


''(i-(flef) 


'  ft  -  2 
h  —  s  —  2  , 


(37) 
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(38) 


where  r|  is  defined  to  be 

=  (1  -  p,]„l  -  (<r„  +  <TP')'(Se  +  EP’j-'lo'.c  +  -rj?) 

and 

(/ig')“  =  ((1  -  Pzylf'iT,,  +  <7-CT)'(Sc  +  SP')-'(o-,o  +  <rP'),  (39) 

which  is  the  square  of  the  multiple  correlation  coefficient  between  and  Cj.  Provided 
the  assumption  0  <  p3  <  1  holds,  Combined  Method  III  will  result  in  a  reduction  of  the 
variance  of  Var(/ty),  if  the  effects  due  to  antithetic  variates  (ps)  and  the  control  variates 
together,  compensate  for  the  loss  factor 

5.2.4  Comparison  of  the  Combined  Methods  and  Control  Vari¬ 
ates  Method 

In  this  section  we  give  a  brief  and  formal  comparison  of  the  three  combined  methods 
presented  in  the  three  previous  sections  and  the  method  of  control  variates.  This  com¬ 
parison  assumes  the  validity  of  the  assumptions  given  for  each  of  these  four  methods 
and  is  done  with  respect  to  the  unconditional  variances  of  the  estimators  for  the  mean 
response  given  in  equations  (5),  (17),  (26),  and  (37),  respectively.  (We  can  say  something 
in  general  about  the  assumptions.  If  the  simulation  response  y  and  the  simulation  model 
are  structured  so  that  y  is  monotonic  in  each  random-number  input  (either  nonincreas¬ 
ing  or  nondecreasing),  then  it  is  quaranteed  that  pi,  p2,  and  p^  are  all  nonnegative  (see 
Brately,  Fox,  and  Schrage  1987). 

First,  we  consider  the  three  combined  methods  presented  earlier.  Comparing  Com¬ 
bined  Methods  I  and  II  via  equations  (17)  and  (26)  yields  that  Combined  Method  I  is 
preferred  to  Combined  Method  II  if 
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Similarly,  working  with  (17)  and  (37)  yields  that  Combined  Method  III  is  better  than 
Combined  Method  I,  provided  that 

Also,  working  with  (26)  and  (37)  yields  that  Combined  Method  III  is  better  than  Com- 
bined  Method  II  if 


(1  -  /)3)(1  -  (/?£*)')  <  (1  -  /’2)(1  -  (7?S)')-  (42) 


As  we  discussed  earlier,  the  loss  factors  for  the  three  combined  methods  are  the  same; 
hence,  they  cancel  when  constructing  the  comparisons.  Thus,  the  preference  of  the  three 
methods  is  determined  according  to  their  minimum  variance  ratios  given  in  (17),  (26), 
and  (37),  respectively.  Of  course,  other  ordering  schemes  for  the  terms  in  equations 
(17),  (26),  and  (37)  could  be  conjectured.  Clearly,  it  is  not  easy  to  identify  an  ordered 
relationship  among  (/?S)^  (4c  )^  and  (4c  )^  since  these  terms  involve  the  unknown 

elements  Sc,  cTyc,  <Tyc\  pii  and  p^. 

Next,  we  compare  these  three  combined  methods  to  the  method  of  control  variates. 
A  comparison  of  equations  (5)  and  (17)  yields  that  Combined  Method  1  is  better  than 
the  control  variates  method  if 


(1-^, -(fli?) 


'  h-2 
h-s-2 


-7  <(i-4c) 


'  2h-2  ' 
2h-  s-2 


(43) 


Also,  comparing  equations  (5),  (26),  and  (37)  shows  that  Combined  Method  II  is  better 
than  the  method  of  control  variates  if 


(l-p2)(l-(4c)') 


h-2 


<(i-4c) 


2h  -  2 


\  2h- s-2 


(44) 


h-s-2, 

\  y  •  • 

and  Combined  Method  III  yields  a  better  result  than  the  method  of  control  variates, 
provided 


(1-P3)(1-(M?)') 


h-2 
h  —  s  —  2 , 


<(i-4c) 


2h-2 


\2h-s-2 


(45) 
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Note  that  the  loss  factor  of  each  combined  method  is  greater  than  that  of  the  method 
of  control  variates.  Hence,  for  preference  of  each  combined  method  to  the  method  of 
control  variates,  the  associated  minimum  variance  ratio  of  the  combined  method  should, 
at  least,  compensate  for  an  increase  in  the  associated  loss  factor.  The  effects  of  antithetic 
variates  and  control  variates  on  the  minimum  variance  ratio  for  Combined  Method  I  are 
represented  by  an  additive  form  in  reducing  the  variance  of  the  estimator  of  the  mean 
response. 

5.2.5  Proof  of  Equation  (17) 

Given  C,  the  conditional  variance  of  fiy  in  (16)  is: 

var(ii,|C)  =  pl'„xM(VxM-C(C'qC)-'C'Q)Cov(y|C)(I„,,|-QC(C'C!C)-'C')ln.i|. 

(46) 

Since  {yj,Cj)  is  obtained  from  the  jth  pair  of  replicates  and  the  pairs  of  replicates  are 
independent,  we  have  from  equation  (14)  that: 

Var(ylC)  =  -[(1  —  pi)ay  —  (ryc’S^  ^ycWhxh)  —  (47) 

Subsituting  for  Var(y|C)  into  (47)  gives: 

var(/i,|C)  =  ^{h  +  r(ix/.)C(C'QC)->C'l(,,i)),  (48) 

since  Ql(hxi)  =  l'(ix/i)Q  =  0(/ixi)  and  =  Q.  From  the  result  given  in  (13)  and  the 
last  assumption  in  (9),  the  {h  x  s)  random  matrix  C  has  the  matrix  normal  distribution: 

C  ~  N/,,s(0(;,X5),I(/ix/i),  2^c), 

where  0(/ixs)  is  a  (h  x  s)  matrix  of  zeroes  (see  Section  17.2  in  Arnold  1981).  From  this 
result  we  have  (see  p.  454  and  Theorem  17.2(d)  of  Arnold  1981): 

Z  =  (-Sc)“2C'  ~  Ns,/i(0(3xA),I(sxs)5l(/ix/i))-  (49) 
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Now,  it  follows  immediately  that 


Z'Z  =  C(^Sc)-'C'  ~  \Wkis,l^kxk)),  (50) 

using  the  definition  of  the  Wishart  distribution  (see  equation  (17.15)  of  Arnold  1981)  and, 
by  Theorem  17.7a  in  Arnold  (1981),  the  s  x  s  random  matrix  (C  QC)  has  the  following 
Wishart  distribution: 

(C'QC)~W,(/i-l,^Sc),  (51) 

since  Q  is  an  idempotent  matrix  with  rank  {h  —  l).  We  note  that  (l'(ix/i)C)  and  (C'QC) 
are  independent  (see  Theorem  17.7b(2)  of  .A.rnold  1981). 

Now,  by  equations  (15)  and  (16),  we  have  that 

E(A,|C)  =  tl'|.x*)[I(»x»,-C(C'QC)-‘C'Q]E(j/]C] 

=  -  C(C'QC)-'C'Ql(,.,l(,xi)  +  Ca). 

n 

(52) 

which,  in  turn,  by  multiplying  through  all  terms  and  using  the  result  Ql(/ixi)  =  0(/ixi) 
given  earlier,  yields 

E(/i,|C)  =  (53) 

From  this,  we  get 

var[E(/ij,|C)]  =  var(/iy)  ==  0.  (54) 

Thus,  the  unconditional  variance  of  fiy  can  be  written  as: 

var(/ly)  =  var[E(/ly|C)]  +  E[var(/iy|C)]  =  0  +  2^2  l(Axi)]) 

(55) 

since  I'C  and  C'QC  are  independent.  Theorems  17.6a  and  17.15d  in  Arnold  (1981)  give, 
respectively, 

E(C(lEe)-’C']=U,*xM.  and  EKC'QC)-']  =  (^  _  ^  _  J  S-'  \f  k>s  +  2.  (56) 
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Therefore,  plugging  the  second  equation  in  (57)  into  (56)  finally  yields 


Var(/iJ 


iL  \h  2E[1Vx/.)CSc'C1  (/ix  1). 

^  ^  {h-s-2) 


(57) 


which  further  reduces  to  equation  (17),  by  equation  (51)  and  Theorem  17.6(a)  of  Arnold 
(1981).  The  result  in  (17)  indicates  that  the  minimum  variance  ratio  of  this  method  is 


(1  —  pi  —  (i7yc^)^)  and  the  loss  factor  is 


6.  Conclusions 

In  this  study  we  have  described  a  valid  computer  simulation  model  of  the  XP  micro¬ 
computer  network  that  is  to  become  operational.  We  have  also  indicated  how  this  sim¬ 
ulation  model  can  be  used  as  an  experimental  device  to  create  data  that  can  be  used  to 
fit  regression  models  between  responses  and  input  factors  of  interest  and  how  these  re¬ 
gression  models  can  be  used  to  help  design  the  network.  We  have  also  developed  a  set  of 
variance  reduction  methodologies  that  greatly  improves  the  estimation  of  the  regression 
model  coefficient  estimators. 

In  the  past,  both  control  variates  and  antithetic  variates  have  been  shown  to  often  be 
effective  variance  reduction  techniques  for  estimating  the  mean  of  a  response  of  interest 
for  simulation  experiments.  In  Section  5  we  have  presented  three  methods  that  can  easily 
be  implemented  in  a  simulation  experiment  without  significant  additional  programming 
effort;  each  combines  both  control  variates  and  antithetic  variates.  Under  general  as¬ 
sumptions,  these  combined  methods  were  shown  to  yield  superior  performance,  based  on 
estimator  variance,  than  control  variates  alone. 
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APPENDIX  I 


SLAM  II  CODE  FOR  SIMULATION  MODEL 
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PROGRAM  MAIN 

COMMON/ SCCM1/ATRI3 (100) , DD ( 100) , DDL ( 100) , DTNOW , II , MFA, 
*MSTOP  ,  NCLNR,  NCRDR,  MPRNT^  NNRUN  ,  NNSET  ,  NTAPE  ,35(100)  , 
*SSL(100) ,TNEXT,TNOW,XX(100) 

COMMON/UCOM1/FX(27) ,XVAL(27) 

COMMON  QSET(200000) 

DIMENSION  NSET (200000) 

EQUIVALENCE  (NSET ( 1) , QSET ( 1) ) 

NNSET=200000 

NCRDR=5 

NPRNT=6 

NTAPE=7 

CALL  SLAM 

STOP 

END 

SUBROUTINE  INTLC 

COMMON/SCOM1/ATRIB(100) ,DD(100) ,DDL(100) , DTNOW, II, MFA, 
*MSTOP , NCLNR , NCRDR , NPRNT , NNRUN , NNSET , NTAPE , SS ( 10  0 )  , 
*SSL(100) ,TNEXT,TNOW,XX(100) 

COMMON/UCOM1/FX(27) ,XVAL(27) 

C********** ************************************************* 
C*  INITITALI2E  THE  CUMULATIVE  DISTRIBUTION  FUNCTION  * 

C*  FX  AND  ITS  ASSOCIATED  VALUES  XVAL  THAT  WILL  DETERMINE* 

C*  THE  ROUTING  SCHEME  OF  INCOMING  MESSAGES  * 

C********* ************************************************** 


FX(1)= 

.0370 

FX(2)= 

.0741 

FX(3)= 

.1111 

FX(4)= 

.1481 

FX(5)= 

.1852 

FX(6)= 

.2222 

FX(7)= 

.2593 

FX(3)= 

.2963 

FX ( 9 )  = 

.3333 

FX(10)= 

.3704 

FX(11)= 

.4074 

FX(12)= 

.4444 

FX(13)= 

.4815 

FX(14)= 

.5185 

FX(15)= 

.5556 

FX(16)= 

.5926 

FX(17)= 

.6296 

FX(18)= 

.  6667 

FX(19)= 

.7037 

FX(20)= 

.7407 

FX(21)= 

.7778 

FX(22)= 

.8148 

FX(23)= 

.8519 

FX(24)= 

.8889 

FX(25)= 

.9259 

FX(26)= 

.9630 

FX(27)= 

1.000 

XVAL(l) 

=  1 

XVAL  (2) 

=2 

XVAL(3) 

=  3 

XVAL(4) 

=4 

XVAL  (5) 

=5 

XVAL(6) 

=6 

y'7AL(7) 

=7 
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XVAL(3)=8 

X’/AL{9)=9 

XVAL(10)=10 

XVAL(11)=11 

XVAL(12)=12 

XVAL(13)=13 

OTAL(14)=14 

XVAL(1S)=15 

XVAL(16)=16 

XVAL(17)=17 

XVAL(ia)=13 

XVAL(19)=19 

XVAL(20)=20 

XVAL(21)=21 

XVAL(22)=22 

XVAL(23)=23 

X’/AL(24)=24 

XVAL(25)=25 

XVAL(26)=26 

XVAL(27)=27 

RETURN 

END 

FUNCTION  USERF(I) 

COMKON/SCOM1/ATRIB(100) ,DD(100) ,DDL(100) , DTNOW, II ,MFA, 
*MSTOP , NCLNR, NCRDR, NPRNT, NNRUN, NNSET, NTAPE , SS ( 100 ) , 
*SSL(100) ,TNEXT,TNOW,XX(100) 

C0MM0N/UC0M1/FX(27) ,XVAL(27) 

US £RF=DPROB ( FX , XVAl , 2 7 , 1 ) 

RETURN 

END 

//GO.SYSIN  DD  * 

GEN, TEW, ROME  NETWORK, 10/6/92, 1; 

UMITS,29,7,1S000,• 

NETWORK ; 

******************************************************* 

;  THIS  IS  THE  SLAM  II  NETWORK  CODE  FOR  THE  ROME  * 

LABORATORY  MACINTOSH  ETHERNET  NETWORK  * 

* 

;  CREATE  THE  MESSAGES  THAT  WILL  FLOW  THROUGH  THE  * 

;  NETWORK  AND  INDICATE  IN  ATTRIBUTE  (2)  WHERE  THAT* 

;  MESSAGE  CAME  FROM  * 

* 

•  ALSO,  LET  ATRIB(4)=1  IF  MESSAGE  IS  ROUTED  TO  * 

;  ANOTHER  NODE  AND  LET  ATRIB(4)=2  IF  MESSAGE  IS  * 

;  IS  TO  GO  TO  THE  STAR  CONTROLLER  * 

* 

;  ATRIB(l)  IS  THE  ARRIVAL  TIME  OF  MESSAGE  * 

;  ATRIB(2)  IS  THE  ORIGINATION  NODE  OF  MESSAGE  * 

;  ATRIBP)  IS  THE  DESTINATION  NODE  OF  MESSAGE  * 

;  ATRI3(4)  INDICATES  IF  MESSAGE  RETURNS  OR  NOT  * 

;  ATRI3(5)  IS  THE  LINE  RESOURCE  NUMBER  * 

;  ATRI3(6)  IS  THE  NUMBER  OF  FRAMES  IN  A  MESSAGE  * 

ATRI3(7)  IS  THE  NUMBER  OF  BYTES  IN  THE  MESSAGE  * 
•*******★★*★***********'*********’**'********************* 


•  Hitit  it*  **************************************  it  ********** 

•  DEFINE  RESOURCE  ELEMENTS  TO  EACH  LINE  IN  THE  * 

;  NETWORK  AS  WELL  AS  THE  STAR  CONTROLLER  * 

■  ****************************************************** 
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resource/ LINEl { 1) , 29 , 15 , 17 , 18 , 19 ; 

RESOURCE/LINE2 (1),29,13; 

RESOURCE / LINE3 (1) ,29,14,16; 

RESOURCE/LINE4(l) ,29,11,12; 

RESOURCE/LINE5(l) ,29,3,9,10; 

RESOORCE/LINE6(l) ,29,7; 

RESQURCE/LINE7(1) ,29,4,5,6; 
resource/lines (1) ,29,1,2,3; 

RES0URCE/LINE9(1) ,29,20; 

RESOURCE/LINE10(1) ,29,22; 

RESOURCE/LINEll(l) ,29,23; 

RESOURCE/LINE12 (1) ,29,25,26; 

RESOURCE/ LINE13 (1) ,29,21; 

RESOURCE/LINE14 (1) ,29,24; 

RES0URCE/LINE15(1) ,29,27; 

RESOURCE/STAR (4) ,28; 

•  iiit  i,  it  *  ii  *  a*  it  **  *it  *  it  *  it  ***********  it -k-k  ******  Hit*  ******  ******* 

;  CORRIDOR  0  (CREATE  MESSAGES  AT  EACH  OF  THE  NODES  * 
;  IN  CORRIDOR  O)  * 

;***********»************»***************************** 


CREATE, EXP(5) , 0.0,1, ,1; 

ASSIGN , ATRIB ( 2 ) =1 , ATRIB ( 5 ) =8 , 

ATRIB ( 6 ) =NPSSN ( 11 , 2 ) , ATRIB (7) =1526 .  ,1; 
AWAIT (1) ,LINE8/l; 

ACT, .00003, , SC; 

CREATE, EXP(5) ,0.0,1,,1; 
ASSIGN,ATRIB(2)=2,ATRIB(5)=8, 

ATRIB(6)=NPSSN(11,2) , ATRIB (7) =1526 .  ,1; 
AWAIT (2) , LINES/ l; 

ACT,.00003,,SC,• 


CREATE, EXP(5) ,0.0,1, ,1; 

ASSIGN ,  ATRIB  ( 2 )  =3  ,  ATRIB  ( 5 )  =8  , 

ATRIB(6)=NPSSN(11,2)  , ATRIB (7) =1526  .,  1  ; 
AWAIT (3) , LINES/ 1; 

ACT, .00003, ,sc; 

CREATE, EXP (5) ,0.0,1, ,1; 

ASSIGN , ATRIB ( 2 ) =4 , ATRIB ( 5 ) =7 , 

ATRIB{6)=NPSSN{11,2) , ATRIB (7 ) =1526  .  ,1; 
AWAIT (4) ,LINE7/l; 

ACT, .00003, ,SC; 

CREATE, EXP (5) ,0.0,1, ,1; 

ASSIGN , ATRIB ( 2 ) =5 , ATRIB ( 5 ) =7  , 

ATRIB(6)=NPSSN(11,2) , ATRIB (7) =1526  .,  1  ; 
AWAIT (5) ,LINE7/l; 

ACT, .00003, ,SC; 

CREATE, EXP(5) ,0.0,1, ,1; 

ASSIGN , ATRIB ( 2 ) =6 , ATRIB ( 5 ) =7 , 

ATRIB(6)=MPSSN(11,2)  ,  ATRIB (7 ) =1526 .,  1  ; 
AWAIT (6) ,LINE7/l; 

ACT, .00003, ,SC; 

CREATE, EXP(5) ,0.0,1, ,1; 
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ASSIGN , ATRIB ( 2 ) =7 , ATRIB ( 5 ) =6 , 

ATRIB(6) =NPSSN(11, 2) , ATRIB (7) =1526 . ,1 
AWAIT (7) ,LINE6/1; 

ACT, .00003, ,SC; 

CREATE, EXP(5) ,0.0,1, ,1; 

.ASSIGN ,  ATRIB  ( 2 )  =8  ,  ATRIB  ( 5 )  =5  , 

ATRIB(6)=NPSSN(11,2) , ATRIB (7) =1526 . ,1 
AWAIT (3) ,LINE5/l; 

ACT, .00003, .SC; 

CREATE, EXP(5) ,0.0,1, ,1; 

ASSIGN , ATRIB ( 2 ) =9 , ATRIB ( 5 ) =5 , 

ATRIB(6)=NPSSN(11,2) , ATRIB (7 ) =1526 .  ,1 
AWAIT (9) , LINES /I, • 
act, .00003, ,SC; 


CREATE, EXP(5) ,0.0,1, ,1; 

ASSIGN , ATRIB ( 2 ) =10 , ATRIB ( 5 ) =5 , 

ATRIB(6)=NPSSN(11,2)  , ATRIB (7) =1526  .  ,1 
AWAIT (10) , LINES /I; 

ACT, .00003, ,SC; 

CREATE, EXP(5) ,0.0,1,,1; 

ASSIGN, ATRIB (2) =11, ATRIB (5) =4, 

ATRIB(6)=NPSSN(11,2) , ATRIB (7 ) =152 6 . ,1 
AWAIT(ll) ,LINE4/l; 

ACT, .00003, ,SC; 


CREATE, EXP(5) ,0.0,1, ,1; 

ASSIGN, ATRIB ( 2 ) =12 , ATRIB ( 5 ) =4 , 

ATRIB(6)=NPSSN(11, 2) , ATRIB (7) =1526 . ,1 
AWAIT (12) ,LINE4/l; 

ACT, .00003, , SC; 


CREATE, EXP(5) ,0.0,1, ,1; 

ASSIGN, ATRIB ( 2 ) =13 , ATRIB ( 5 ) =2 , 

ATRIB(6)=NPSSN(11,2)  , ATRIB ( 7 ) =152 6 .  ,  1 
AWAIT (13) ,LINE2/l; 

ACT, .00003, ,SC; 


CREATE, EXP(5) ,0.0,1,,!; 

ASSIGN , ATRIB ( 2 ) =14 , ATRIB ( 5 ) =3 , 

ATRIB(6)=NPSSN(11,2) , ATRIB (7) =1526 .,  1 
AWAIT (14) ,LINE3/l; 

ACT, .00003, ,SC; 

CREATE, EXP(5) ,0.0,1, ,1; 

ASSIGN , ATRIB ( 2 ) =15 , ATRIB ( 5 ) =1 , 

ATRIB(6)=NPSSN(11,2)  , ATRIB (7 ) =1526 1 
AWAIT (15) ,LINE1/1; 

ACT, .00003, ,sc; 

CREATE, EXP (5) ,0. 0, 1, ,1; 

ASSIGN ,  ATRIB  ( 2 )  =16 ,  ATRIB  ( 5 )  =3  , 

ATRIB(6)=NPSSN(11,2) , ATRIB (7 ) =1526 ., 1 
AWAIT (16) ,LINE3/1; 

ACT, .00003, ,SC; 

CREATE, EXP(5) ,0.0,1, ,1; 
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ASSIGN , ATRIB ( 2 ) =17 , ATRIB ( 5 ) = 1 , 

ATRIB(5) =NPSSN(11, 2) , ATRIB (7) =1526 . ,1; 

AWAIT (17) ,LINE1/1; 

ACT, .00003, ,SC; 

CREATE, EXP(5) ,0.0,1, ,1; 

ASSIGN  ,  .ATRIB  ( 2 )  =18  ,  ATRIB  ( 5 )  =1 , 

.ATRI3(6)=NPSSN(11,2)  , ATRIB ( 7)  =1526 .,  1  ; 

AWAIT (18) ,LINE1/1; 

ACT, .00003, , SC; 

CREATE, EXP(5) ,0.0,1, ,1; 

ASSIGN, ATRIB(2) =19, ATRIB ( 5) =1, 

ATRIB(6)=NPSSN(11,2) , ATRIB (7) =1526 ., 1 ; 

AWAIT (19) ,LINE1/1; 

ACT, .00003, , sc; 

■k -k  it* -k*  it-k  ********************  ****‘^t***  ******  ************* 

CORRIDOR  A  (CREATE  MESSAGE  AT  EACH  OF  THE  NODES  * 

IN  CORRIDOR  A)  * 

******************************************************* 


CREATE, EXP(5) ,0.0,1,,1; 

ASSIGN, ATRIB (2) =20, ATRIB (5) =9, 

ATRIB(6)=NPSSN(11,2) , ATRIB (7) =1526 ., 1; 

AWAIT (20) ,LINE9/l; 

ACT, .00003, ,SC; 

******************************************************* 
CORRIDOR  B  (CREATE  MESSAGES  AT  EACH  OF  THE  NODES  * 
IN  CORRIDOR  B)  * 

******************************************************* 


CREATE, EXP(5) ,0.0,1,,!; 

ASSIGN, ATRIB (2) =21, ATRIB (5) =13, 

ATRIB(6)=NPSSN(11,2) , ATRIB (7 ) =1526 1 ; 
AWAIT (21) ,LINE13/l; 

ACT, .00003, , SC; 

CREATE, EXP (5) ,0.0,1,,1; 

ASSIGN, ATRIB (2) =22, ATRIB (5) =10, 

ATRIB(6)=NPSSN(11,2) , ATRIB (7) =1526 . ,1; 
AWAIT (22) ,LINE10/1; 

ACT, .00003, , SC; 

CREATE, EXP (5) ,0.0,1,,1; 

ASSIGN, ATRIB (2) =23, ATRIB (5) =11, 

ATRIB(6)=NPSSN{11,2) , ATRIB (7) =1526 ., 1 ; 
AWAIT(23) ,LINE11/1; 

ACT, .00003, , SC; 

CREATE, EXP (5) ,0.0,1, ,1; 

ASSIGN , ATRIB ( 2 ) =24 , ATRIB ( 5 ) =14 , 

ATRIB(6)=NPSSN{11,2) , ATRIB (7) =1526 ., 1 ; 
AWAIT (24) /LINE14/1; 

ACT, .00003, , SC; 

CREATE, EXP(5) ,0.0,1,,1; 

ASSIGN, ATRIB (2) =25, ATRIB (5) =12, 

ATRIB(6)=NPSSN(11,2) , ATRIB (7 ) =1526 . ,1; 
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AWAIT (25)  ,I,IWE12/l; 
ACT, .00003, ,SC; 


CREATE , EXP ( 5 ) , 0 . 0 , 1 , , 1 ; 

ASSIGN , ATRIB ( 2 ) =2  6 , ATRIB ( 5 ) =12 , 

ATRI3(6)=MPSSN(11,2) , ATRIB (7) =1526 . , 1 ; 
AWAIT (2 6) ,LINE12/1; 

ACT, .00003, ,SC; 

CREATE, EXP (5) , 0.0, 1, , 1; 

ASSIGN , ATRIB ( 2 ) =27 , ATRIB ( 5 ) =15 , 

ATRI3(6)=MPS3N(11,2) , ATRIB ( 7 ) =1526 ., 1 ; 

AWAIT (27) ,LINE15/1; 

ACT, .00003, ,SC; 

******************************************************** 

;  STAR  CONTROLLER  * 

•  * 

;  DETERMINE  WHERE  THE  MESSAGE  IS  GOING  TO  THEN  PUT  IT  * 
;  IN  THE  STAR  CONTROLLER  QUEUE  * 

;  ATRIB (3)  GIVES  THE  DESTINATION  OF  THE  MESSAGE  * 

•  * 

;  25  PERCENT  OF  MESSAGES  GO  BACK  TO  POINT  OF  * 

;  ORIGINATION  AND  75  PERCENT  GO  TO  ANOTHER  NODE  * 

•  * 

;************■*****************•******************»****** 

SC  ASSIGN, ATRIB(3)=USERF(1) , 

ATRIB(4)=l,l; 

ACT ,0.0, ATRIB ( 3 ) -ATRIB ( 2 ) . LE . 0 . 5 .  AND . 

ATRIB(2)-ATRIB(3) .LE.0.5,SC; 

ACT, 0.0; 


FREE,ATRI3(5)/l,l; 

ACT, 0.0, . 25, ASS; 

ACT, 0.0, .75,G01; 

ASS  ASSIGN, ATRIB (3) =ATRIB (2) , 
ATRIB(4)=2,1; 

ACT, 0.0, ,GO; 

GOl  GOON,l; 

ACT, 0.0, ATRIB (3) .LE.1.5,L8; 
ACT, 0.0, ATRIB (3) .LE.2.5,L8; 
ACT,0.0,ATRIB(3) .LE.3.5,L8; 
ACT, 0.0, ATRIB (3) .LE.4.5,L7; 
ACT, 0.0, ATRIB (3) .LE.5.5,L7; 
ACT ,0.0, ATRIB (3).LE.6.5,L7; 
ACT, 0.0, ATRIB (3) .LE.7.5,L6; 
ACT,0.0,ATRIB(3)  .LE.8.5,L5; 
ACT, 0.0, ATRIB (3) .LE.9.5,L5; 
ACT,0.0,ATRIB(3) .LE.10.5,L5; 
ACT, 0.0, ATRIB (3) .LE.11.5,L4; 
ACT,0.0,ATRIB(3) .LE.12.5,L4; 
ACT,0.0,ATRIB(3) .LE.13.5,L2; 
ACT, 0.0, ATRIB (3) .LE.14.5,L3; 
ACT, 0.0, ATRIB (3) .LE.15.5,L1; 
ACT, 0.0,ATRIB(3) .LE.16.5,L3; 
ACT, 0.0, ATRIB (3) .LE.17.5,L1; 
ACT, 0. 0,ATRIB(3) .LE.18.5,L1; 
ACT, 0.0, ATRIB (3) .LE.19.S,L1; 
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ACT,0.0,ATRI3(3) .LE.20.5,L9; 
ACT,0.0,ATRI3(3) .LE.21.5,L13; 
ACT,0.0,ATRI3(3) .LE.22.S,L10; 
ACT,0.0,ATRI3(3) .LE.23.5,L11; 
ACT,0.0,ATRIB(3) . LE. 24 . 5 , L14  ; 
ACT,0.0,ATRIB(3) .LE.25.5,L12; 
ACT,Q.0,ATRI3(3) .LE.26.5,L12; 
ACT,0.0,ATRIB(3) . LE. 27 . 5 , L15 ; 
LI  ASSIGN, ATRI3(5)=1,1; 

ACT ,  0 . 0  ,  ,  GO  ; 

L2  ASSIGN, ATRIB (5) =2,1; 

ACT ,  0 . 0  ,  ,  GO  ; 

L3  ASSIGN, ATRI3(5)=3,1; 

ACT , 0 . 0 ,  ,  GO  ; 

L4  ASSIGN, ATRIB (5) =4,1; 

ACT, 0.0, ,GO; 

L5  ASSIGN, ATRIB (5) =5,1; 

ACT ,  0 . 0  ,  ,  GO  ; 

L6  ASSIGN, ATRIB (5) =6,1; 

ACT ,  0 . 0  ,  ,  GO  ; 

L7  ASSIGN, ATRIB(5)=7,1; 

ACT ,  0 . 0  ,  ,  GO  ; 

L3  ASSIGN, ATRIB (5) =8,1; 

ACT, 0.0, ,GO; 

L9  ASSIGN, ATRIB (5) =9,1 ; 
ACT,0.0,,GO; 

LIO  ASSIGN, ATRIB (5) =10,1; 

ACT, 0.0, ,GO; 

Lll  ASSIGN, ATRIB (5) =11,1; 

ACT,0.0, ,GO; 

L12  ASSIGN, ATRIB(5)=12, l; 
ACT,0.0,,GO; 

L13  ASSIGN, ATRIB(S) =13,1; 

ACT,0.0, ,GO; 

L14  ASSIGN, ATRIB (5) =14,1; 

ACT, 0.0, ,GO; 

LIS  ASSIGN, ATRIB(5)=15,1; 

GO  GOON,l; 

ACT, 0.0; 


•  •***★**★******•**★**•****★**♦**♦  ****^** 

;  THE  STAR  CONTROLLER  * 

•  ititiHt  ******  Hit  ************************* 


AWAIT (28) ,STAR/X,1; 

ACT/l, . 00003*ATRIB(6) *ATRIB(7) ; 
FREE, STAR/ 1,1; 

AWAIT{29) ,ATRIB(5) /l,l; 
ACT,0.00003; 

GOON,  1; 

ACT, . 00003, ATRIB(3) .LE. 1.5, 0102 
ACT, . 00003, ATRIB(3) .LE. 2. 5, 0104 
ACT, . 00003, ATRIB(3) .LE. 3. 5, 0106 
ACT, . 00003, ATRIB(3) .LE. 4. 5, 0107 
ACT, . 00003, ATRIB(3) .LE. 5. 5, 0112 
ACT, . 00003 ,ATRIB(3) .LE. 6. 5, 0114 
ACT, .00003, ATRIB (3) .LE. 7. 5, 0116 
ACT, . 00003, ATRIB (3) .LE. 8. 5,0118 
ACT, . 00003, ATRIB(3) .LE. 9. 5, 0122 
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act  0Q003,ATRIB(3) .LE.10.5,0124; 
act',  .00003,ATRIB(3)  .  EE.  11 . 5 , 0129  ; 
ACT, . 00003, ATRIB(3) . LE. 12 . 5 , 013 0 ; 
ACT,  .  00003,  ATRIBH)  .  LE.  13 . 5 , 0137  ; 
ACT,  .  00003,  ATRIB^)  .LE.14.5,0138; 
act, .00003, ATRIB(3) . LE. 15 . 5 , 0139 ; 
ACT,  .  00003,  ATRIBH)  .  LE.  16 . 5 , 0140  ; 
act, .00003,ATRIB(3) . LE. 17 . 5 , 0141 ; 
ACT, . 00003, ATRIB(3) . LE. 13 . 5 , 0143 ; 
ACT, .00003,ATRIB(3) .LE.19.5,0144; 
act, .00003,ATRIB(3) .LE.20.5,A103; 
act, .00003, ATRIB(3) .LE . 21 . 5 , B109 ; 
ACT, . 00003, ATRIB(3) . LE. 22 . 5 , BllO ; 
act, .00003,ATRIB(3) . LE. 23 . 5 , B112 ; 
act, . 00003, ATRIB(3) .LE.24.5,B113; 
act,  .00003,  ATRIB(3)  .  LE.  25 . 5  ,  BUS  ; 
ACT, .00003 , ATRIB(3) . LE. 26 . 5 , B120 ; 
act, . 00003, ATRIB(3) .LE. 27. 5 , B119 ; 


.****************•************************************** 

■  COLLECT  STATISTICS  ON  EACH  MESSAGE  BEFORE  IT  LEAVES  * 

'i*I*L**™*^******************************************* 


0102  C0LCT,INT(1) ,0102; 
ACT, , ,TERM; 

0104  C0LCT,INT(1) ,0104; 
act, , ,TERM; 

0106  C0LCT,INT(1) ,0106; 
ACT, , ,TERM; 

0107  C0LCT,INT(1) ,0107; 
act, ,, TERM; 

0112  COLCT,INT(l) ,0112; 
act, , ,TERM; 

0114  C0LCT,INT(1) ,0114; 
ACT, , ,TERM; 

0116  C0LCT,INT(1) ,0116; 
act, , ,TERM; 

0118  C0LCT,INT(1) ,0118; 
act, , ,TERM; 

0122  C0LCT,INT(1) ,0122; 
ACT, , ,TEEM; 

0124  C0LCT,INT(1) ,0124; 
ACT, , ,TERM; 

0129  COLCT,INT(l) ,0129; 
ACT, , ,TERM; 

0130  C0LCT,INT(1) ,0130; 
act, , ,TERM; 

0137  C0LCT,INT(1) ,0137; 
ACT, , ,TERM; 

0138  COLCT,INT(l) ,0138; 
ACT, , ,TERM; 

0139  C0LCT,INT(1) ,0139; 
ACT, , ,TERM; 

0140  C0LCT,INT(1) ,0140; 
act, , ,TERM; 

0141  C0LCT,INT(1) ,0141; 
ACT, , ,TERM; 

0143  C0LCT,INT(1) ,0143; 
ACT,  ,  ,TEEM; 


12-46 


0144  C0LCT,INT(1) ,0144; 

ACT, , ,TEEM; 

A103  C0LCT,INT(1) ,A103; 

ACT, , ,TERM; 

3109  C0LCT,INT(1) ,B109; 

ACT, , ,TERM; 

BllO  C0LCT,INT(1) ,B110; 

ACT, , ,TESM; 

B112  C0LCT,INT(1) ,B112; 

ACT, , ,TERM; 

B113  C0LCT,INT(1) ,B113; 

ACT, , ,TERM; 

B116  C0LCT,INT(1) ,B116; 

ACT, , ,TERM; 

B119  C0LCT,INT(1) ,B119; 

ACT, , ,TERM; 

B120  COLCT, INT(l) ,B120; 

ACT, , ,TEEM; 

TEEM  FEEE,ATRIB(5)/1,1; 

COLCT, INT(l) , TOTAL  TIME; 
ASSIGN , XX ( 1 ) =ATRIB ( 6 ) *ATRIB ( 7 ) 
COLCT, XX ( 1) , TOTAL  BYTES; 
TEEMINATE; 

ENDNETWORK ; 

SEEDS, 79725(1) ,93872(2)  ; 

INIT,0,3600; 

SIMULATE; 

FIN; 
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APPENDIX  II 


EXAMPLE  OF  SLAM  II  SUMMARY  REPORT 
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SLAM  II 


ECHO  REPORT 


SIMULATION  PROJECT  ROME  NETWORK  BY  TEM 

DATE  10/  6/1992  RUN  NUMBER 

SLAM  II  VERSION  AUO  87 


GENERAL  OPTIONS 


PRINT  INPUT  STATEMENTS  (ILIST).  YES 
PRINT  ECHO  REPORT  (lECHO)t  YES 
EXECUTE  SIMULATIONS  (IXOT)i  YES 
WARN  OF  DESTROYED  ENTITIES:  YES 
PRINT  INTERMEDIATE  RESULTS  HEADING  (IPIRH)!  YES 
PRINT  SUMMARY  REPORT  (ISMRY):  YES 


LIMITS  ON  FILES 

MAXIMUM  NUMBER  OF  USER  FILES  (MFILS)<  29 
MAXIMUM  NUMBER  OF  USER  ATTRIBUTES  (MATR).  7 
MAXIMUM  NUMBER  OF  CONCURRENT  ENTRIES  (HNTRY)t  15000 


FILE  SUMMARY 


FILE 

INITIAL 

RANKING 

NUMBER 

ENTRIES 

CRITERION 

1 

0 

FIFO 

2 

0 

FIFO 

3 

0 

FIFO 

4 

0 

FIFO 

5 

0 

FIFO 

6 

0 

FIFO 

7 

0 

FIFO 

8 

0 

FIFO 

9 

0 

FIFO 

10 

0 

FIFO 

11 

0 

FIFO 

12 

0 

FIFO 

13 

0 

FIFO 

14 

0 

FIFO 

15 

0 

FIFO 

16 

0 

FIFO 

17 

0 

FIFO 

1  OF 
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18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 
29 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


FIFO 

FIFO 

FIFO 

FIFO 

FIFO 

FIFO 

FIFO 

FIFO 

FIFO 

FIFO 

FIFO 

FIFO 


HISTOGRAM  SPECIFICATIONS 
NCEL  HLOW  HWID 


RANDOM  NUMBER  STREAMS 

STREAM  SEED  REINITIALIZATION 

NUMBER  VALUE  OF  STREAM 
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1 

21825 

HO 

2 

19585 

HQ 

3 

1H5661099 

NO 

4 

1835732737 

HO 

5 

794161987 

NO 

6 

1329531353 

NO 

7 

200496737 

NO 

8 

633816299 

NO 

9 

1410143363 

NO 

10 

1282538739 

NO 

IHITIALIZATIOH  OPTIOHS 

BEGINNING  TIME  OF  SIMULATION  (TTBEG):  O.OOOOE+OO 


ENDING  TIME  OF  SIMULATION  (TTFIH).  O.3600E+0A 
STATISTICAL  ARRAYS  CLEARED  <JJCLR)i  YES 

VARIABLES  INITIALIZED  (JJVAR);  YES 

FILES  INITIALIZED  (JJFIDi  YES 


NSET/qSET  STORAGE  ALLOCATION 

DIMENSION  OF  NSET/OSET  (NNSET)i  200000 
HORDS  ALLOCATED  TO  FILING  SYSTEM!  165000 
WORDS  ALLOCATED  TO  VARIABLES:  15950 
WORDS  AVAILABLE  FOR  PL 0T3/TA BLES :  19050 


INPUT  ERRORS  DETECTED:  0 


EXECUTION  WILL  BE  ATTEMPTED 


XXIHTERMEDIATE  RESULTS** 


SLAM  I 

I  SUMMARY 

REPORT 

SIMULATION  PROJECT  ROME 

NETWORK 

BY  TEW 

DATE  10/  6/1992 

RUN  NUMBER 

CURRENT  TIME  0.3600E+OA 

STATISTICAL  ARRAYS  CLEARED  AT  TIME  O.OOOOE+OO 


**STATISTICS  FOR  VARIABLES  BASED  ON  OBSERVATION** 

MEAN  STANDARD  COEFF.  OF  MINIMUM  MAXIMUM 

VALUE  DEVIATION  VARIATION  VALUE  VALUE 


1  OF  1 


NUMBER  OF 
OBSERVATIONS 
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0102 

0101 

0106 

0107 

0112 

0111 

0116 

0118 

0122 

0121 

0129 

0130 

0137 

0138 

0139 

0110 

0111 

0113 

0111 

A103 

8109 

8110 
8112 
8113 
8116 

8119 

8120 

TOTAL  TIME 
TOTAL  BYTES 


a.1050E+03 
0.3898E+03 
0.1061E+03 
0  .  BOIlEt-OB 
0.3918E+03 
0 . 1283E+03 
0.1030Et03 
0 . 3987E+03 
0.3933E+03 
0.1057E+03 
0.3831ET03 
0.3933E+03 
0 . 1001E+03 
0.3525E+03 
a.3877E+03 
0.3896E+03 
0.3886E+03 
0 . 3911E+03 
0 . 3773E+a3 
0.3932E+03 
0.1033E+03 
0 . 3758E+03 
0.3837E+03 
0.1015E+03 
0.3371E+03 
0.1023E+03 
0.3830E+03 
a.3928E+03 
0.1681E+05 


0 .2050E+03 
0.222SEt03 
0 . 2097E+03 
0 .2059E+03 
0.2110E+03 
0,2156E+03 
0 .2166E+03 
0 .2051E+03 
0.2U5E+03 
0.2079E+03 
0.218QEt03 
0 .2215E+03 
0,2017E+03 
0.2093E+03 
0.2155E+03 
0.2123E+03 
0.21E0E+03 
0.2152Et03 
0.2081E+03 
0,2235E+03 
0.2061Et03 
0 .2119E+a3 
0.2208E+03 
0 .2098E+03 
0.2175E+03 
0.2211E+03 
0.2199E+03 
Q.2139E+03 
0.1988E+01 


0.5062E+00 

0.5716E+00 

0.5163E+00 

0.5226E+00 

0.5385E+00 

0.5027E+00 

0.5375E+00 

0.5152E+00 

0.5372E+00 

0.5121E+00 

0.5690E+00 

O.S709E+00 

0.5117E+00 

0.5957E+00 

0.5558E+00 

0.5161E+00 

fl.5533E+00 

0.5503E+00 

0.5S16E+00 

0.5683E+00 

0.5109E+00 

0.5720E+00 

0.5755E+00 

0.5225E+00 

0.5613E+00 

0.5572E+00 

0.5712E+00 

0.5116E+00 

0.2963E+00 


0.1338E+02 

0.7508E+03 

283 

0 . 2106E+01 

0 . 7181E+03 

2^9 

0 . 190SE+02 

0.7519E+a3 

275 

0 . 2009E+02 

0.7510E+03 

274 

0.1579E+00 

0.7527E+03 

272 

0.1255E+02 

0 .7168E+03 

240 

0 . 1165E+01 

0.7513E+03 

269 

0.6638E+01 

0.7522E+03 

253 

0.9339E+01 

0 .7191E+03 

265 

0.1113E+02 

0.7172E+03 

282 

0.1312E+02 

0.7525E+03 

252 

0.9705E+01 

0 .7166E+03 

260 

0 . 1156E+02 

0.7505E+03 

255 

0.1271E+02 

0 .7177E+03 

244 

0.1335E+02 

0.7512E+03 

263 

0.1021E+02 

0.7528E+03 

237 

0 . 1018E+02 

0.7180E+03 

254 

0.2610E+01 

0.7190E+05 

246 

0 . 1300E+02 

0.7176E+03 

281 

0.1312E+02 

0.7503E+03 

289 

0.3800E+01 

0 .7186E+03 

243 

0.1282E+01 

0.7515E+03 

247 

0.8790E+01 

0 .7181E+03 

288 

0.7371E+01 

0.7531E+03 

293 

0.7691E+01 

0.7529E+03 

292 

0.1310E+02 

0.7528E+03 

252 

0.5951E+01 

0.7188E+03 

268 

0.1579E+00 

0.7551E+03 

7126 

O.OOOOE+00 

0.3510E+05 

7126 

»*FILE  STATISTICS** 


FILE 

NUMBER 

LABEL/TYPE 

AVERAGE 

LENGTH 

1 

AWAIT 

0  .  0000 

2 

AWAIT 

0.0000 

3 

AWAIT 

0 . 0000 

4 

AWAIT 

0 . 0000 

5 

AWAIT 

0  .  0000 

6 

AWAIT 

0 . 0000 

7 

AWAIT 

0 . 0000 

8 

AWAIT 

0 . 0000 

'  9 

AWAIT 

0.0000 

10 

AWAIT 

0.0000 

11 

AWA I T 

0  .  0000 

12 

AWAIT 

0.0000 

13 

AWAIT 

0 . 0000 

14 

AWAIT 

0.0000 

15 

AWAIT 

0.0000 

16 

AWAIT 

0 . 0000 

:  17 

AWAIT 

0.0000 

STANDARD 

MAXIMUM 

CURRENT 

DEVIATION 

LENGTH 

LENGTH 

0.0000 

1 

0 

0.0001 

1 

0 

0.0001 

1 

0 

0.0000 

1 

0 

0.0001 

1 

0 

0.0001 

1 

0 

0.0000 

1 

0 

0.0000 

1 

0 

0.0001 

1 

0 

0.0001 

1 

0 

0.0000 

1 

0 

O.OOOl 

1 

0 

0.0000 

1 

0 

0 . 0000 

1 

0 

0.0000 

1 

0 

0.0001 

1 

0 

0.0001 

1 

0 

AVERAGE 
HAITING  TIME 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 
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18 

ANA  IT 

0 . 0000 

0.0001 

1 

0 

0 

19 

AWAIT 

0 . 0000 

0.0002 

1 

0 

0 

20 

AWAIT 

0 . 0000 

0.0000 

1 

0 

0 

21 

AWAIT 

0.0000 

0 . 0000 

1 

0 

0 

22 

AWAIT 

0 . 0000 

0.0000 

1 

0 

0 

23 

AWAIT 

0 .0000 

0.0000 

1 

0 

0 

24 

AWAIT 

0 . 0000 

0.0000 

1 

0 

0 

25 

AWAIT 

0 . 0000 

0.0000 

1 

0 

0 

26 

AWAIT 

0 . 0000 

0.0001 

1 

0 

0 

27 

AWAIT 

0 . 0000 

0.0000 

1 

0 

n 

23 

AWAIT 

967 . 0222 

521.9036 

1877 

1876 

386 

29 

AWAIT 

0 . 0000 

0.0000 

1 

0 

0 

30 

CALENDAR 

28 . 0000 

0.0031 

44 

28 

0 

ACTIVITY 

AVERAGE 

XXREGULAR  ACTIVITY  STATISTICS** 

STANDARD  MAXIMUM  CURRENT 

ENTITY 

IHDEX/LABEL 

UTILIZATION 

DEVIATION 

UTIL 

UTIL 

COUNT 

1 

1 . 0000 

0.0001 

1  1 

7126 

XXRESOURCE  STATISTICS** 


RESOURCE 

RESOURCE 

CURRENT 

AVERAGE 

NUMBER 

LABEL 

CAPACITY 

UTILIZATION 

1 

LINEl 

I 

0.0000 

2 

L1ME2 

1 

0.0000 

3 

LINES 

1 

0.0000 

4 

LIME'S 

1 

0.0000 

5 

LINES 

1 

O.QOOO 

6 

LINES 

1 

Q.QOOO 

7 

LINE? 

I 

0.0000 

8 

LINES 

1 

0.0000 

9 

LINES 

1 

0.0000 

10 

LINEIO 

1 

0 . 0000 

11 

LINEll 

1 

0.9000 

12 

LINE12 

1 

0.0000 

13 

LINE13 

1 

0.9000 

14 

LINEIA 

1 

0.9000 

15 

LINE15 

I 

0.0000 

16 

STAR 

1 

1.0900 

STANDARD 

MAXIMUM 

CURRENT 

DEVIATION 

UTILIZATION 

UTILIZATION 

O.OOU 

I 

0 

0.0005 

1 

0 

0.0007 

1 

0 

0.0008 

I 

0 

0.0008 

1 

0 

0.0005 

1 

0 

0 . 0009 

1 

0 

O.OD09 

1 

0 

0 . 0006 

1 

0 

0.0005 

1 

0 

0.0006 

1 

0 

0.0008 

1 

0 

0 . 0005 

1 

0 

0 . 0006 

1 

0 

0.0005 

1 

0 

0.0001 

I 

1 

RESOURCE 

NUMBER 

1 

,  2 
3 


RESOURCE 

LABEL 

LINE! 

LINE2 

LINES 


CURRENT 

AVAILABLE 

1 

1 

I 


AVERAGE 

AVAILABLE 

1.0000 

l.OOQO 

l.OOOO 


MINIMUM 

AVAILABLE 

0 

0 

0 


MAXIMUM 

AVAILABLE 

1 

1 

1 


0000 

0000 

0000 

0000 

0000 

0000 

0000 

0000 

0000 

0000 

6797 

0000 

7417 
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<5 

UHE<i 

1 

1.0000 

0 

1 

5 

LIMES 

1 

1.0000 

0 

I 

6 

LINES 

I 

1.0000 

0 

1 

7 

LINE7 

I 

1.0000 

0 

1 

8 

LIMES 

1 

1.0000 

0 

1 

9 

LIME9 

I 

1.0000 

0 

1 

10 

LIHEIO 

1 

1.0000 

0 

1 

u 

LIMEll 

1 

1.0000 

0 

1 

12 

LINEI2 

I 

1.0000 

0 

1 

13 

LINE13 

I 

1.0000 

0 

1 

19 

LINEH 

I 

1.0000 

0 

1 

15 

LIME15 

1 

1.0000 

0 

1 

16 

STAR 

0 

0.0000 

0 

1 
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